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Some topics pertaining to algebras of linear 

operators 

Stephen Semmes 


Abstract 

Here we consider finite-dimensional vector spaces and linear op¬ 
erators on them. Some of the basic ingredients will be motivated by 
♦-algebras of linear transformations on inner product spaces, as in 
| Arv , iDo^ • In particular, let us note that in the theory of C* and 
Von Neumann algebras, the “double commutant” of a *-algebra is a 
kind of basic closure of the algebra. 

A special case concerns algebras of operators generated by repre¬ 
sentations of finite groups [ Bui , Cur2 , Ser2[. The use of characters is 
included in each of these three books, and is generally avoided in the 
present article. 

Part of the point of view about the p-adic absolute value and p-adic 
numbers that we follow here is that they can be interesting in con¬ 
nection with theory of computation and related settings just because 
of their nice properties and simplicity, aside from more traditional 
number-theoretic uses, even if those are also very interesting. 
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1 Preliminaries 

As usual, the real and complex numbers are denoted R and C, respectively. 

1.1 Finite groups 

Normally groups in this article will be finite. 

A finite group is a finite set G with a choice of identity element e in G 
and a binary operation G x G —> G, (x, y) t —> xy, such that xe = ex = x for 
all x 6 G, every element x of G has an inverse x -1 , which satisfies 

(1.1) xx~ l = x _1 x = e, 
and the group operation is associative, so that 

(1.2) x(yz) = ( xy)z 
for all x, y , and z in G. 

Two elements x, y of G are said to commute if 

(1.3) xy = yx. 

The group G is said to be commutative, or abelian , if every pair of elements 
in G commutes. 

A subset H of G is a subgroup if e e H, and if x € H and xy e H 
whenever x,y G H. 

If G and L are groups and / : G —> L is a mapping between them, then 
/ is said to be a homomorphism if 

(1-4) f(xy) = f(x) f(y) 

for all x, y in G. Note that / automatically maps the identity element of G 
to the identity element of L in this case, and that 

(1.5) f^ 1 ) = fix) -1 

for all x in G. An isomorphism between two groups is a homomorphism that 
is one-to-one and maps the first group onto the second one. In other words, 
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the homomorphism should be invertible as a mapping, and one can check 
that the inverse mapping is a homomorphism too. 

If / : G —> L is a homomorphism between groups, then the kernel of f 
is the subset K of G consisting of the elements x of G such that f(x) is the 
identity element of L. It is easy to see that K is a subgroup of G. 

A subgroup N of G is said to be normal if xyx~ l G N whenever y G N 
and x G G. The kernel of a homomorphism between groups is clearly a 
normal subgroup. Conversely, if N is a normal subgroup of G, then N is the 
kernel of a homomorhism. This follows from the standard quotient subgroup 
construction. 

1.2 Vector spaces 

We make the convention that a vector space means a vector space defined 
over some held k which has positive finite dimension (unless the contrary is 
indicated). If V and W are vector spaces with the same scalar held k, then we 
let C(V, W) denote the set of linear mappings from V to W. Thus £(V, W) 
is also a vector space over k, using ordinary addition of linear operators and 
multiplication of them by scalars. The dimension of £(V, W) is equal to the 
product of the dimensions of V and W. 

For V = W we simply write C(V) for C(V,V). For each Li,L 2 G £(V), 
the composition L x o L 2 , (Zq o L 2 ){v) = Zq(Z/ 2 (v)) for all v G V, is also a 
linear mapping on V, and hence an element of C{V). We sometimes write 
this simply as LiL 2 . This defines a product operation on Ciy), which is 
not commutative in general, but which does satisfy the associative law and 
distributive laws relative to addition and scalar multiplication. The identity 
mapping / or Iy on V, which takes every element of V to itself, is linear and 
is the identity element of C(V) with respect to this product. 

Let us write GL(V) for the set of linear transformations on G which are 
invertible. (If a linear mapping is invertible simply as a mapping, then the 
inverse mapping is automatically linear as well.) Thus GL(V ) is a group 
which is infinite. 

Suppose that iq,..., v n is a basis for the vector space V with scalar held 
k. If T is any linear transformation on V, then there is a unique n x n matrix 
(tj t k) with entries in k such that 

n n n 

(1.6) T(y OtlVi ) = ££ tj,k Vj 

1=1 j =1 k =1 
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for all Q!i,..., a n in k. Conversely, if (tj^) is an n x n matrix with entries i 


m 


k, then (|1.6|) defines a linear transformation T on V. 

As usual, if S and T are linear transformations on V corresponding to 
n x n matrices {sj.k) and (fy m ) (with respect to the basis Ui,... ,v n ), then 
S + T corresponds to the sum of the matrices, given by (syfc + tyfc)- If 7 is any 
scalar, then the matrix of 7 T is (7 tyfc)- The composition S o T has matrix 


Uj m ) given by the classical matrix product of 


Sj, k ) and (t I>r 


namely, 


(1.7) 


n 

U j,m = S J 
k =1 


k tk,r 


The matrix associated to the identity transformation / is {Sj.k ), where Sj : k = 1 
when j — k and 5j^ = 0 when j 7 ^ k. It is easy to check directly that the 
matrix product of with another matrix (in either order) is always equal 
to the other matrix. 


1.3 Representations of finite groups 

Let G be a group and let V be a vector space. A representation of G on V 
is a mapping p from G to invertible linear transformations on V such that 


( 1 . 8 ) 


Pxy Px 0 Py 


for all x and y in G. Here we use p x to denote the invertible linear transfor¬ 
mation on V associated to x in G, so that we may write p x (v) for the image 
of a vector v G V under p x . As a result of (| 1 . 8 p , we have that 


(1.9) 


Pe ~ I j 


where / denotes the identity transformation on V, and 

( 1 - 10 ) p x -1 = (p *) -1 


for all x in G. 

In other words, a representation of G on V is a homomorphism from G 
into GL(V). The dimension of V is called the degree of the representation. 

Basic examples of representations are the left regular representation and 
right regular representation over a field k, defined as follows. We take V to 
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be the vector space of functions on G with values in k. For the left regular 
representation, we define L x : V —> V for each x in G by 

(1-11) L x (f)(z) = f(x _1 z) 

for each function f(z) in V. For the right regular representation, we define 
R x : V —» V for each x in G by 

(1.12) R x (f)(z) — f(zx) 

for each function f(z) in V. Thus if x and y are elements of G, then 

(1.13) (L, o L„)(f)(z) = L,(L,(f))(z) = (L s (f))(x- 1 z) 

= f(y~ 1 x~ l z) = f((xy)~ 1 z) = L xy (f)(z), 

and 

(1.14) (R, o Ry)(f)(z) = R x (Ry(f))(z) = (Ry(f))(zx) 

= f(zxy) = R v (f)(z). 

Another description of these representations which can be convenient is 
the following. For each w in G, define the function <j) w (z) on G by 

(1.15) 4> w (z) = 1 when z — w, 4>w( z ) — 0 when z ^ w. 

Thus the functions (f) w for w in G form a basis for the space of functions on 
G. One can check that 

(1.16) L x ((j) w ^ (frxwi Rxi^frw) 0UIX -1 

for all x in G. 

Observe that 

(1.17) L x o R y = R y o L x 
for all x and y in G. 

More generally, suppose that we have a homomorphism from the group 
G to the group of permutations on a nonempty finite set E. That is, suppose 
that for each x in G we have a permutation n x on E, i.e., a one-to-one 
mapping from E onto E, such that 

(1.18) 7T X O 7 Ty TTry 
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As usual, this implies that 7r e is the identity mapping on E, and that n x -i 
is the inverse mapping of tt x on E. Let V be the vector space of fc-valued 
functions on E. Then we get a representation of G on V by associating to 
each x in G the linear mapping LL^ : V —> V defined by 

(1-19) n x (/)(a) = f(n x -i(a)) 

for every function /(a) in V. This is called the permutation representation 
corresponding to the homomorphism x i—> n x from G to permutations on E. 
It is indeed a representation, because for each x and y in G and each function 
f(a) in V we have that 

(1.20) (n*on„)(/)(a) = n > (n„(/))(a) = (n s (/))( 7 r I -.(a)) 

= /(ir„-i(uvi(a))) = /(ir (ls) -i(o)). 

Alternatively, for each b e E one can define ^(a) to be the function on E 
defined by 

(1.21) fpb( a ) — 1 when a — b, t/>b( a ) — 0 when a ^ b. 

Then the collection of functions ipb for b E E is a basis for V, and 

(1-22) n ; c (ip b ) = 

for all x in G and b in E. 

Suppose that Vi and V 2 are vector spaces over the same field k, and that 
T is a linear isomorphism from Vi onto V 2 . Assume also that p 1 and p 2 are 
representations of a group G on Vi and V 2 , respectively. If 

(1.23) Topl = p 2 x oT 

for all x in G, then we say that T determines an isomorphism between the 
representations p 1 and p 2 . We may also say that p 1 and p 2 are isomor¬ 
phic group representations, without referring to T specifically. Of course 
isomorphic representations have equal degrees, but the converse is not true 
in general. 

For example, let Vi = V 2 be the vector space of fc-valued functions on G, 
and define T on Vi = V 2 by T(f)(a) = /(a^ 1 ). This is a one-to-one linear 
mapping from the space of fc-valued functions on G onto itself, and 

(1.24) T o R x = L x o T 
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for all x in G. For if /(a) is a function on G, then 


(1.25) (ToR x )(f)( a ) 


T(Rx(f))(a) = -R :r (/)(a -1 ) 

/ (a -1 a;) = T(f)(x~ 1 a) 
L x {T(f))(a) — (L x o T)(f)(a). 


Therefore the left and right regular representations of G are isomorphic to 
each other, in either the real or complex case. 

Now suppose that G is a group and p is a representation of G on a vector 
space V over the field k, and that Vi,... ,v n is a basis of V. For each x in G 
we can associate to p x an n x n invertible matrix with entries in k using this 
basis, as in Subsection |1 . 2| . We denote this matrix by M x . The composition 
rule (|1.8|) can be rewritten as 


(1.26) 


M xy = M x 


M 


v> 


where the matrix product is used on the right side of the equation. 

A different choice of basis for V will lead to a different mapping x N x 
from G to invertible n x n matrices. However, the two mappings x t —> M x , 
x t —¥ N x will be similar , in the sense that there is an invertible n x n matrix 
S with entries in k such that 


(1.27) 


N„ = SM X S 


-1 


for all x in G. That is, S is the matrix corresponding to the change of basis. 
On the other hand, we can start with a mapping x i—► M x from G into 


invertible n x n matrices with entries in k which satisfies ( |1.26|) , and convert it 
into a representation on an n-dimensional vector space over k by reversing the 
process. That is, one chooses a basis for the vector space, and then converts 
nxn matrices into linear transformations on the vector space using the basis 
to get the representation. One might as well take the vector space to be k n : 
the space of n-tuples with coordinates in k (using coordinatewise addition 
and scalar multiplication), and the basis Vi,... ,v n to be the standard basis, 
where v t has ith coordinate 1 and all others 0. In this way one gets the usual 
correspondence between n x n matrices and linear transformations on k n . 

If one has two representations of G on vector spaces V\ , V 2 with the same 
scalar field k, then these two representations are isomorphic if and only if 
the associated mappings from G to invertible matrices as above, using any 
choices of bases on V\ and V 2 , are similar, with the similarity matrix S having 
entries in k. 






1.4 Reducibility 

Let G be a finite group, V a vector space over a field k, and p a representation 
of G on V. Suppose that there is a vector subspace W of V such that 

(1.28) p x (W) C W 

for all x in G. This is equivalent to saying that 


(1.29) 


PA W) = W 


for all x in G, since one could apply ( |1.28|) also to p x -1 = (p®) -1 . We say 
that W is invariant or stable under the representation p. 

Recall that a subspace Z of V is said to be a complement of a subspace 


W if 

(1.30) wnz = {0}, W + Z = R. 


Here W + Z denotes the span of W and Z, which is the subspace of V 
consisting of vectors of the form w + z, w E W, z E Z. The conditions (|1.30|) 
are equivalent to saying that 


(1.31) every vector v G V can be written in a 

unique way as w + z, w G W, z G Z. 


Complementary subspaces always exist, because a basis for a vector subspace 
of a vector space can be enlarged to a basis of the whole vector space. 

If W, Z are complementary subspace of a vector space V, then we get 
a linear mapping P on V which is the projection of V onto W along Z and 
which is defined by 

(1.32) P(w + z) = w for all w G W, z E Z. 

Thus I — P is the projection of V onto Z along W, where / denotes the 
identity transformation on V. 

Note that P 2 = P, when P is a projection. Conversely, if P is a linear 
operator on V such that P 2 = P, then P is the projection of V onto the 
subspace of V which is the image of P along the subspace of V which is the 
kernel of P. 
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1.5 Reductibility, continued 

Again let G be a finite group, V a vector space over a field k, p a represen¬ 
tation of G on V, and W a subspace of V which is invariant under p. In this 
subsection we assume that either 

(1.33) k has characteristic 0 
or 

(1.34) k has positive characteristic and the number of elements 
of G is not divisible by the characteristic of k. 

Let us show that there is a subspace Z of V such that Z is a complement 
of W and Z is also invariant under the representation p of G on V. To do 
this, we start with any complement Z 0 of IT in V, and we let P 0 : V —> V be 
the projection of V onto W along Z 0 . Thus P 0 maps V to IT, and P 0 (w) = w 
for all w G W. 

Let m denote the number of elements of G. Define a linear mapping 
P : V -> V by 

(1-35) P = — Px ° Po ° {PxY 1 - 

m x£G 

Our assumption on k implies that 1/m makes sense as an element of k, 
i.e., as the multiplicative inverse of a sum of m l’s in k, where 1 refers to 
the multiplicative identity element of k. This expression defines a linear 
mapping on V, because Po and the p x ’s are. We actually have that P maps 
V to W, because Po maps V to W, and because the p x ’s map W to W, 
by hypothesis. If w £ IT, then (p x ) _1 (tc) G IT for all x in G, and then 
Po(iPx)- 1 (w)) = (p x )~ 1 (w). Thus we conclude that 

(1.36) P(w) = w for all w G IT, 

by the definition of P. 

The definition of P also implies that 

(1.37) p y o P o (py)- 1 = P 
for all y in G. Indeed, 

(1.38) pyoPo(py)- 1 = — p y ° Px o P 0 o (p^ 1 o (py)- 1 

171 x£G 
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= ^ E/v ° p o ° (/v) 1 

' n xGG 

= —J2px°Po° ( Px)~ l = P- 

m x£G 

This would work as well for other linear transformations instead of P 0 as the 
initial input, but a subtlety is that in general one can get the zero operator 
after taking the sum. The remarks in the preceding paragraph ensure that 
this does not happen here, except in the degenerate situation where W = {0}. 

Because P(V) C W and P(w) = w for all w G W, P is the projection of 
V onto W along some subspace Z of V. Specifically, one should take Z to 
be the kernel of P. It is easy to see that W D Z = {0}, since P(w) = w for 
all w G W. On the other hand, if v is any element of V, then we can write 
v as P(v) + (v — P(v)). We have that P(v) G W, and that 

(1.39) P(v - P(v)) = P(v) - P(P(v)) = P(v) - P(v) = 0, 


where the second equality uses the fact that P(v) G W. Thus v — P(v) lies 
in Z, the kernel of P. This shows that W and Z satisfy (|1.30|) , so that Z is 
a complement of W in V. The invariance of Z under the representation p 
follows from (|1.37|) . 

Thus the representation p of G on V is isomorphic to the direct sum of 
the representations of G on W and Z that are the restrictions of p to W and 
Z. 


There can be smaller invariant subspaces within these invariant subspaces, 
so that one can repeat the process. Before addressing this, let us introduce 
some terminology. We say that subspaces W\, W 2 , ..., Wh of V form an 
independent system if W 3 ^ {0} for each j and if Wj G W 3 , 1 < j < h, and 


(1.40) 


i>i=o 

3 = 1 


imply 

(1.41) Wj = 0, j = 1,2,..., h. 

If, in addition, span(Wi,..., Wh) = V, then every vector v in V can be 
written in a unique way as x u j with Uj G Wj for each j. 


Definition 1.42 Let G be a finite group, U be a vector space, and a be a 
representation of G on U. We say that a is irreducible if there are no vector 
subspaces of U which are invariant under a except for {0} and U itself. 
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Lemma 1.43 Suppose that G is a finite group, V is a vector space over a 
field k which satisfies ( \1.33j ) or (\ 1.34) , an d p is a representation of G onV. 
Then there is an independent system of subspaces W \,..., Wh ofV such that 
span(VLi,..., Wh) = V, each Wj is invariant under p, and the restriction of 
p to each Wj is an irreducible representation of G. 


In other words, the representation p of G on V is isomorphic to a di¬ 
rect sum of irreducible representations of G (which are restrictions of p to 
subspaces of V). It is not hard to prove the lemma, by repeatedly finding 
invariant complements for invariant subspaces, until one reaches subspaces 
to which the restriction of p is irreducible. More precisely, suppose that one 
has an independent system of subspaces of V whose span is V and for which 
each subspace in the system is invariant under p. Initially this system could 
consist of V by itself. If the restriction of p to each subspace in the system 
is irreducible, then we have what we want for the lemma. Otherwise, for 
each subspace where the restriction of p is not irreducible, there is a pair 
of nontrivial complementary subspaces in that subspace which are invariant 
under p. We can use these complementary subspaces in place of the sub¬ 
space in which they were found, and this leads to a new independent system 
of invariant subspaces of V whose span is all of V. (It is not hard to check 
that the new collection of subspaces of V still forms an indepedent system.) 
By repeating this process, we can get irreducibility, as in the lemma. 


Remark 1.44 Exercise 16.7 on pl36 of [|Scr2|| discusses examples where rep¬ 
resentations over fields of positive characteristic (dividing the order of the 
group) do not “come from” representations over fields of characteristic 0. 


1.6 Positive elements and symmetric fields 

Let k 0 be a held of characteristic 0. We say that a subset A of k 0 is a set 
of positive elements if (i) 0 does not lie in A, (ii) x + y and xy lie in A 
whenever x,y G A, and (iii) w 2 lies in A whenever w is a nonzero element 
of k(). Observe that the multiplicative identity element 1 in ko lies in A by 
(iii), and that —1 does not lie in A, since otherwise one could use (ii) to get 
that 0 is in A. Any sum of l’s lies in A, so that k 0 must have characteristic 
0 in order for a set A of positive elements to exist. If x is an element of 
A, then 1 fx lies in A, because x 0 by (i), 1/x 2 G A by (iii), and hence 
1/x — x ( 1/x 2 ) is in A by (ii). 
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A field k of characteristic 0 is said to be a symmetric field if the following 
conditions are satisfied. First, we ask that k be equipped with an automor¬ 
phism called conjugation which is an involution, i.e., the conjugate of the 
conjugate of an element x of k is equal to x. The conjugate of x G k is 
denoted x, and we write k 0 for the subheld of k consisting of elements x such 
that x — x. The second condition is that k 0 be equipped with a set A of 
positive elements. (Thus A and the conjugation automorphism are part of 
the data for a symmetric held.) The last condition is that xx G A for all 
nonzero elements x of k. 

It may be that the conjugation automorphism is equal to the identity, so 
that k = k 0 . In this case A; is a symmetric held if it is equipped with a set of 
positive elements. 

Of course any subheld of the held of real numbers has a natural set of 
positive elements, namely the elements that are positive in the usual sense. 
A basic class of symmetric helds are subhelds of the complex numbers which 
are invariant under complex conjugation, using complex conjugation as the 
conjugation automorphism on the held. In this case the subheld of elements 
hzed by the conjugation is the subheld of real numbers in the held, and for 
the set of positive elements we again use the elements of the held which are 
positive real numbers in the usual sense. 

1.7 Inner product spaces 

Let k be a symmetric field, and suppose that V is a vector space over k. 
A function (v, w) on V x V with values in k is an inner product on V if it 
satishes the following three conditions. First, for each w G V, the function 

(1.45) v h->• ( v , w) 
is linear. Second, 

(1.46) (w,v) — (v,w) 

for all v,w G V, where x denotes the conjugate of an element x of k, as in 
Subsection H3- Third, if v is a nonzero vector in V, then (v, v) lies in the 
set of positive elements associated to k as in Subsection [U], and is nonzero 
in particular. Of course the second condition implies that (v, v) lies in the 
subheld k 0 of k consisting of x such that x — x. 

A vector space equipped with an inner product is called an inner product 
space. It is easy to see that there are plenty of inner products on a vector 
space over a symmetric held, by writing them down explicitly using a basis. 
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Suppose that (•, •) is an inner product on the vector space V over the 
symmetric held k. Two vectors u,w G V are said to be orthogonal if 


(1.47) 


(u, w) = 0 . 


A collection of vectors Vi, ..., v m in V is said to be orthogonal if v 3 and Vi 
are orthogonal when j ^ l. It is easy to see that any orthogonal collection 
of nonzero vectors in V is linearly independent. A collection Z 1} ..., Z r of 
vector subspaces of V is said to be orthogonal if any vectors in Z 3 and Z p , 
j 7 ^ p, 1 < j,p < r, are orthogonal. 

Assume that u\,... ,u m are nonzero orthogonal vectors in V, and let U 
denote their span. For each w in U, we have that 


(1.48) 


m 

w = J2 

3 =1 


{w,u 3 ) __ 

/ \ ”7 

\Uj,Uj) 


In other words, w is some linear combination of the u/ s, and then the inner 
product and the assumption of orthogonality can be used to determine the 
coefficients of the Uj' s, as in the preceding formula. Define a linear operator 
P on V by 


(1.49) 


m 

P(v) = E 


3 = 1 


( V i U i) .. 
/ \ 
\Uj, Uj) 


Then P(v) lies in U for all v in V, P(w ) = w for all w in U, and 


(1.50) (. P(v),w) = (v,w) for all v G V and w EU. 


This last condition is equivalent to saying that v — P(v ) is orthogonal to 
every element of U. 


Remark 1.51 The operator P : V — > V is characterized by the requirements 
that P(v) lie in U and v — P(v) be orthogonal to every element of U for all 
v in V. Specifically, if v is a vector in V and z i and Z 2 are two elements 
of U such that v — z\ and v — Z 2 are orthogonal to all elements of U, then 
z\ — Z 2 - This is because z\ — Z 2 lies in U, and z\ — Z 2 = (z\ — v) + (v — z 2 ) is 
orthogonal to all elements of U, so that z\ — Z 2 is orthogonal to itself. As a 
consequence, P does not depend on the choice of orthogonal basis for U. 


Lemma 1.52 Ifv i,..., v m are linearly independent vectors in V, then there 
are nonzero orthogonal vectors u\, ..., u m in V such that the span ofv i,..., v m 
is equal to the span of iq,. .., u m . 
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To prove this, one can argue by induction. The m = 1 case is trivial, and 
so we suppose that the statement is true for some m and try to establish it 
for m + 1. Let V\,... ,v m+ i be a set of m + 1 linearly-independent vectors 
in V. By the induction hypothesis, there are nonzero orthogonal vectors 
U\, ... ,u m such that the span of Vi,... , v rn is equal to the span of Mi, ..., u m . 
Let P be the projection onto the span of Ui,... ,u m defined above, and set 
u m +1 = v m+ i — P{v m+ 1 ). Then u m+ 1 ^ 0, because u m +i does not lie in 
the span of by linear independence. From the properties of P 

we know that u m+ 1 is orthogonal to Mi,..., u m , and hence Mi,..., w m+ i is a 
collection of nonzero orthogonal vectors. It is not hard to check that the span 
of Mi,..., w m+ i is equal to the span of Mi,..., u m+ i, using the corresponding 
statement for m and the definition of u m+ \. This completes the proof of the 
lemma. 


Corollary 1.53 Every nonzero subspace ofV admits an orthogonal basis. 


(One could say that the subspace {0} has the empty orthogonal basis.) 
Corollary |1.53| follows from Lemma |1.52|, by starting with any basis for 
the subspace. 


Corollary 1.54 IfU is a vector subspace ofV, then there is a unique linear 
operator P = Pu on V (the orthogonal projection of V onto U) such that 
P(v) lies in U for all v in V, v — P(v) is orthogonal to all elements ofU for 
any v in V, and P{w ) = w for all w in U. 

This follows from the earlier discussion, since we know from the previous 
corollary that U has an orthogonal basis. 

If U is a vector subspace of V, then the orthogonal complement U L of U 
in V is the vector subspace of V defined by 

(1.55) U ± — {v G V : (v,w) = 0 for all w G U}. 

Notice that 

(1.56) unu ± = {o}. 

We can reformulate the characterizing properties of the orthogonal projection 
Pjj of V onto U as saying that for each vector v in V, Pu(v) lies in U and 
v — Pjj{v ) lies in C ± . Thus V is the span of U and t/ ± . Also, U L is exactly 
the same as the kernel of Pjj. 
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Let us check that 

(1.57) (f/ ± ) ± = U. 

for any subspace U of V. The inclusion U C ( U ± ) _L is a simple consequence of 
the definition, and so it is enough to show that (f/" L ) _L C U. Let w G ( U “ L )“ L 
be given. We know that Pu(w) G U C (f/- 1 )- 1 -, and hence w — Pu(w) G (t/" 1 )" 1 
as well. On the other hand, w — Puiw ) lies in U L . Hence w — Puiw) lies 
in both U L and (TA 1 )- 1 , and is therefore 0. This shows that w = Puiw) is 
contained in U, as desired. 

Notice that 

(1.58) iV = I - Pu, 

since I — Pu satishes the properties that characterize Pu±. Also, if U\, U 2 
are orthogonal subspaces of V, then U\ + U 2 = V if and only if U 2 = U [*", 
which is equivalent to U\ = U 2 . 

Suppose that T is a linear operator on V. The adjoint of T is the unique 
linear operator T* on V such that 

(1.59) (T(v),w) = (v,T*(w)) 

for all v,w G V. It is not hard to describe the matrix of T* with respect to 
an orthogonal matrix in terms of the matrix of T with respect to the same 
basis. 

The adjoint of the identity operator I on V is itself. If S and T are linear 
operators on V, then 

(1.60) (T*)* = T, 

(1.61) (S + T)* = S* + T*, 

and 

(1.62) (SoT)* = T*oS*. 

If a is an element of the scalar field k, then 

(1.63) (a T)* = aT*. 

A linear operator S on V is said to be self-adjoint if S* = S. Sometimes 
one says instead that S is symmetric. As a basic class of examples, if U is 
a subspace of V and Pu '■ V —> V is the orthogonal projection of V onto U, 
then Pu is self-adjoint. More precisely, if v\ and v 2 are arbitrary vectors in 
V, then 

(1.64) ( Pu(v 1 ),v 2 ) = (Pu(vi),Pu(v 2 )) = (v 1? Pu(v 2 )), 
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since Pu(v ) E U and v — Pu(v ) € U L for v E V. 

A linear operator A on V is said to be antiself-adjoint, or antisymmetric , 
if A* = —A. Any linear operator T on V can be written as the sum of 
a self-adjoint operator S and an antiself-adjoint operator A by taking S = 
(T + T*)/2 and A — (T — T*)/ 2. 

Suppose that R is a linear operator on V such that 

(1.65) (R( v ), R(w)) = (v, w) 

for all v, w E V. This is equivalent to saying that R is invertible and 

(1.66) R - 1 = R*. 

One often says that R is an an orthogonal transformation or a unitary trans¬ 
formation. 

The notion of adjoints can be extended to mappings between two inner 
product spaces. More precisely, let (Vi, (*, -)i) and (Vj, (•, -) 2 ) be two inner 
product spaces, with the same symmetric held k of scalars, and let T be a 
linear mapping from Vj to Vj. The adjoint of T is the unique linear mapping 
T* : V 2 —> Vj such that 

(1.67) (T(u), w) 2 — (u, T*(w))i 

for all u E Vj and w G V 2 . The adjoint can be described easily in terms 
of orthogonal bases and matrices again, although now one would use an 
orthogonal basis in each of Vj and V 2 . 

As before, (T*)* = T, (R + T)* = R* + T*, and ( aT )* = aT* for any 
two linear operators R,T : V\ —> V 2 and any scalar a G k. If (V 3 , (•, -) 3 ) is 
another inner product space with the same scalar held k , and if S : V 2 —> V 3 
is linear, so that S o T : Vi —> V s is defined, then (S o T)* = T* o S*, as linear 
mappings from V\ to V 3 . 

Note that a linear mapping T : Vj —> V 2 preserves the inner products on 
Vj and V 2 , in the sense that 

(1.68) (T(u),T(v)) 2 = (u,v) 1 

for all u, v E Vj, if and only if T* o T is the identity operator on Vj. In this 
case T is one-to-one, but it may not map Vj onto Vj. If it does, so that T 
is invertible, then we can simply say that T preserves the inner products if 

nn* rp— 1 
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1.8 Inner products and representations 

Let G be a finite group, let V be a vector space over a symmetric field k, and 
let p be a representation of G on V. If (-, •) is an inner product on V, then 
(•, •) is said to be invariant under the representation p, or simply p-invariant, 
if every p x : V —■► V, x in G, preserves the inner product, i.e., if 

(1.69) (p x (v),p x (w)) = (v,w) 
for all x in G and v, w in V. 

If (•, -)o is any inner product on V, then we can obtain an invariant inner 
product (•, •) from it by setting 

(1.70) (v, w) = J2 {py(v),p y (w)) 0 . 

y&G 

It is easy to check that this does define an inner product on V which is 
invariant under the representation p. Notice that the positivity condition 
for (-, •) is implied by the positivity condition for (-, -) 0 , which prevents (•, •) 
from reducing to 0 in particular. 

In some situations it is easy to write down an invariant inner product for 
a representation directly. In the case of the left and right regular represen¬ 
tations for a group G over the symmetric field k , as in Subsection |L3| , one 
can use the inner product 

(1-71) (fi, f 2 ) = 51 /i( x ) M x )- 

x£G 

More generally, for a permutation representation of G relative to a nonempty 
finite set E, as in Subsection [L3|, one can use the inner product 

(1.72) (/,./i) = E/iW® 

a£E 

This inner product is invariant under the permutation representation, be¬ 
cause the permutations simply rearrange the terms in the sums without af¬ 
fecting the sum as a whole. 

Let (•, •) be any inner product on V which is invariant under the repre¬ 
sentation p. Suppose that IT is a subspace of V which is invariant under p, 
so that 

(1.73) p x (W) = IT 
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for all x in G. Let W 1 be the orthogonal complement of W in V with respect 
to this inner product (•,•), as in (|1.55|) . Then 


(1.74) 


p x {W L ) = W J 


for all x in G, since the inner product is invariant under p. This gives another 
approach to finding an invariant complement to an invariant subspace, as in 
Subsection p~5| . 

One can repeat this to get invariant subspaces on which p restricts to be 
irreducible, as in Subsection |1 . 5| . The next lemma is the analogue of Lemma 
1.431 in this situation. 


Lemma 1.75 Suppose that G is a finite group, V is a vector space over a 
symmetric field k, and p is a representation of G on V. Assume also that 
(•, •) is an inner product on V which is invariant under p. Then there are 
orthogonal nonzero subspaces W \,..., IT), ofV such that span(lT 1 ,..., Wfi) = 
V, each Wj is invariant under p, and the restriction of p to each Wj is an 
irreducible representation of G. 


2 Algebras of linear operators 

2.1 Basic notions 

Let V be a vector space over a held k, and recall that C{V) denotes the 
collection of linear operators on V. 

Definition 2.1 A subset A of jC(V) is said to be an algebra of linear op¬ 
erators on V if A is a vector subspace of £( V) which contains the identity 
transformation on V and which contains S o T whenever S,T £ A. 

Thus, for example, £( V) is an algebra of linear operators on V, as is the 
set of scalar multiples of the identity transformation on V. 

Definition 2.2 If A is a subset of C{V), then the commutant of A is defined 
to be the set of operators T £ C(V) such that T o S = S o T for all S £ A. 
The commutant of A is denoted A 1 . 
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For example, C{V)' is the set of scalar multiples of the identity, and the 
commutant of the set of scalar multiples of the identity is C(V). 

If A is any subset of C{V ), then the algebra of operators on V generated 
by A is the algebra consisting of linear combinations of the identity operator 
and finite products of elements of A. It is easy to see that the commutant of 
A is the same as the commutant of the algebra of operators on V generated 
by A. The commutant of any subset A of C(V) is an algebra of operators 
on V. 

The double commutant of a set A C C(V) is simply the commutant of 
the commutant of A, {A!)' . For simplicity we also write this as A" . By 
definition, we have that 
(2.3) A C A". 

We shall be interested in conditions which imply that A = A". Of course it 
is necessary that A be an algebra of operators on V for this to hold. 

Let us note the following. 

Lemma 2.4 If A is any set of operators on V, then A'" = A' (where A" 1 = 
(A")' denotes the triple commutant of A). 

As in (|2.3|) , A C A'". Conversely, if T lies in A then T commutes with 
all elements of A", and hence T commutes with all elements of A. because 
A C A". Thus T G A. 

The next general fact about algebras of operators will also be useful. 

Proposition 2.5 Let V be a vector space over the field k, and let A be an 
algebra of operators on V. Suppose that T lies in A, and that T is invertible 
as a linear operator on V. Then T~ l also lies in A. 

If T is any linear operator on V and P(z) = Yfj=o a j A is a polynomial 
with coefficients in k, then we write P(T) for the linear operator ctj TT 
As usual, z° is interpreted here as being 1, and T° is interpreted as being the 
identity operator on V. 

Because C(V) has finite dimension as a vector space over V, there is a 
nontrivial polynomial P with coefficients in k such that P(T ) = 0. In other 
words, the operators T J , j = 0,..., n, cannot be linearly independent in C(V) 
when n is large enough (e.g., if n is equal to the square of the dimension of 
V), and a linear relation among them leads to a polynomial P such that 
P(T ) = 0. 
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If T is invertible, then we may assume that the constant term in P is 
nonzero. That is, if the constant term were 0, then we can write P(z) as 
z Pi(z), where P\(z) is another polynomial (with degree 1 less than the degree 
of P). In this case we have that T Pi(T) = 0, and hence P\ (T) = 0, since 
T is invertible. By repeating this process as necessary, we can reduce to the 
case where the constant term is not 0 . 

Thus we assume that P(0) 7 ^ 0. Now, P(z) — P{ 0) is a polynomial whose 
constant term vanishes, and so we may write it as zQi(z), where Qi(z) is a 
polynomial of degree 1 less than that of P. We obtain that T Qi(T) = P( 0) I, 
which is the same as saying that T _1 = P(O )^ 1 Qi(T). Thus T -1 can be 
expressed as a polynomial in T, and the proposition follows. 

For the original polynomial P(z) one can in fact take P(z) = det (T — z I), 
because P(T ) = 0 in this event by the Cayley-Hamilton theorem. The 
constant term P(0) = det T is already nonzero when T is invertible. 

2.2 Nice algebras of operators 

Let V be a vector space over a field k again. 

Definition 2.6 Let A be an algebra of operators on V. A vector subspace 
W ofV is invariant under A ifT(W ) C W for all T in A. 

Definition 2.7 An algebra A of operators on V is said to be a nice algebra 
of operators if every vector subspace W of V which is invariant under A is 
also invariant under A". 

Note that every vector subspace W of V which is invariant under A" 
is automatically invariant under A, since A C A". Thus one can rephrase 
Definition p.7| as saying that an algebra of operators A on V is a nice algebra 
of operators if A and A" have the same invariant subspaces. 

Lemma 2.8 Let S and T be linear operators on V, and suppose that SoT = 
T o S. If W denotes the image of S and Z denotes the kernel of S, then 
T(W) C W and T{Z) C Z. 

This is easy to check. 

Lemma 2.9 Let A be an algebra of linear operators on V. If S lies in the 
commutant A' of A, then the image and kernel of S are invariant subspaces 
of the double commutant A". 
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This follows from Lemma |2.8| . 

Thus, to show that an algebra is nice, one can try to show that all of 
its invariant subspaces come from linear operators in the commutant in this 
way. 

Lemma 2.10 Let A be an algebra of operators on V. Suppose that U\, U\ 
are two vector subspaces ofV which are complementary, so that UiC\U 2 = {0} 
and U\ + U 2 — V. Let P : V —> V denote the projection of V onto U\ along 
U 2 , be., P[ui + u 2 ) = U\ for all U\ G Ui and u 2 G U 2 . Then P lies in A 1 if 
and only if Ui and U 2 are invariant subspaces of A. In this case U\ and U 2 
are also invariant subspaces of the double commutant A". 


Assume first that U\ and U 2 are invariant subspaces of A. Thus if T lies 
in A, Ui lies in IJ\, and u 2 lies in U 2 , then T{u\) is also an element of U\ and 
T{u 2 ) is an element of U 2 , and hence 

(2.11) P(T( Ul + u 2 )) = P(T( Ul ) + T(u 2 )) = T{uf) = T(P( Ul + u 2 )). 


This shows that T commutes with P. We conclude that P lies in A' , since 
this applies to any T in A. 

Conversely, if P lies in A' , then U\ and U 2 are invariant subspaces for A", 
and hence for A, because of Lemma 2.9. More precisely, this uses the fact 


that Ui is the image of P and U 2 is the kernel of P, by construction. 


2.3 ^-Algebras 

In this subsection we assume that V is a vector space over a symmetric field 
k and that V is equipped with an inner product (■,•). 

Definition 2.12 A subset A of C(V) is said to be a ^-algebra of linear 
operators on V if it is an algebra and if the adjoint T* of an operator T lies 
in A whenever T does. 

The algebra C(V) and the set of scalar multiples of the identity are *- 
algebras. 

Lemma 2.13 If A is a *-algebra of operators on V, then the commutant AI 
of A is a *-algebra of operators on V as well. 

This is easy to check. 
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Lemma 2.14 If A is a *-algebra of linear operators on V, and if W is a 
vector subspace of V which is invariant under A, then W ± is also invariant 
under A. 


The hypothesis that W be invariant under A can be rewritten as 

(2.15) {T(w),u) = 0 

for all w G W, u G B /J_ , and T G A. This implies in turn that 

(2.16) (w,T*(u)) = 0 

for all w G W, u G W 1 , and T G A. In other words, T*{W L ) C W 1 for all 
T G A. Because A is assumed to be a *-algebra, this is equivalent to saying 
that S'(IT) C W for all S G A, as desired. 

Corollary 2.17 If A is a *-algebra of operators on V, then A is a nice 
algebra of operators on V. 


This follows easily from Lemmas [2. 1 4| and |2 .1 ( . 


2.4 Algebras from group representations 

Let G be a finite group, V be a vector space over a field k, and p a repre¬ 
sentation of G on V. Consider the set A of linear operators on V defined 
by 

(2.18) A = span{p x : x G G}. 

Here “span” means the ordinary linear span inside the vector space £(V). 
Thus 

(2.19) the dimension of A as a vector space is less 
than or equal to the number of elements in G. 

On the other hand, 

(2.20) A is an algebra of linear operators on V. 

This is easy to check, using the fact that p is a representation of G on V (so 
that p x o p y = p xy for all x and y in G, and hence p x o p y lies in A for all x 
and y in G ). 
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Lemma 2.21 Under the conditions just described, a vector subspace W of 
V is invariant under A if and only if it is invariant under the representation 
p (as defined in Subsection 

This is easy to verify. 

Lemma 2.22 Under the conditions above, if k has characteristic 0, or if k 
has positive characteristic and the number of elements of G is not divisible 
by the characteristic of k, then A is a nice algebra of operators. 


This uses the fact that each subspace of V which is invariant under the 
representation p has an invariant complement, as in Subsection |1.5| , and 
Lemma [2.10. 


Suppose now that k is a symmetric field, (■, •} is an inner product on V, 
and that (•, •) is invariant under the representation p, as in Subsection [T~E . 
This is the same as saying that 

(2.23) G^)- 1 = (p x y 

for each x in G, as in ( |1.66| ) in Subsection |1.7| . We can rewrite (|2.23| ) as 

(2.24) { Px )* = p x -1 . 

It follows easily that A is a ^-algebra in this case, since the adjoint of each 
of the pA s lies in A, by Q2.24|) . 

In other words, 


(2.25) 


the correspondence T 
“linearizing” the correspondence x i—► x^ 1 on G. 


T* has the effect of 
x~ 


Note that because the inner product is assumed to be invariant under 
p, one gets immediately that the orthogonal complement of an invariant 
subspace is also invariant, as in Subsection |1.8| . This also follows from the 
general result for *-algebras in Lemma [2.14. 


2.5 Regular representations 

Let G be a finite group and k be a field. Define V to be the vector space of k- 
valued functions on G, and consider the left regular representation L x , ieG, 
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and right regular representation R x , x G G, of G on V, as in Subsection [T~0 . 
Let Al and Ar denote the algebras of operators on V generated by the 
linear operators in the left and right regular representations, respectively. 
We would like to show that 


(2.26) 


A' l = Ar and Ar = Al- 


Once we do this, it follows that A" = Al and Ar = Ar. 

The inclusions Ar C A' l and Al C A' R follow from (|1.17|) in Subsection 
[Hf Conversely, suppose that S is an operator on V that lies in A' L . This 
condition is equivalent to 

(2.27) SoL x = L x oS 

for all x in G. We would like to show that S can be written as a linear 
combination of the R y 's for y in G. For each w in G, let (j) w (z) be the 
function on G defined by (j) w {z) = 1 when z = w and (f) w {z) = 0 when z/w, 
as in Subsection [L3| . Thus the 0^’s form a basis for V, and L x (cj ) w ) = (j) xw 
for all x, w in G (which was mentioned before in (|1.16|) ). Using (|2.27|) we 
obtain that 

(2.28) S(<f> x ) = S(L x ((f> e )) = L x (S(0 e )) 

for all x in G. Of course S(<j) e ) is a function on G, which we can write as 

(2.29) S(<j) e )= 

w&G 


where in fact c w = S(4> e )(w) for all w in G. This and (|2.28|) permit us to 
write S((j) x ) as 


(2.30) 


^(^a;) ^ ) C-w 4>xw ^ ) Cw -^w -1 (0a;), 

w£G w£G 


since R u (4 > x ) = < j > xu - 1 , as in ( |1.16|) . Thus 
(2.31) 5= Y, c w Rw-h 

weG 

which is what we wanted. In the same way one can show that if an operator 
commutes with R y for all y in G, then the operator is a linear combination of 
LA s. Therefore A' L C Ar and A' R C Al- This completes the proof of (j2.26p. 
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2.6 Expanding an algebra 

Let V be a vector space over a field k, and let A be an algebra of linear 
operators on V. Fix a positive integer n, and let V n denote the vector space 
of functions / from {1,2 ,... , n} to V. In effect, V n is a direct sum of n 
copies of V, and this is a convenient way to express it. 

If T is a linear operator on V, then we can associate to it a linear operator 
T on V n by 

(2.32) T(f)(j)=T(fU)) 

for each / e V n and j — 1,2,.. ,,n. In other words, T is the same as T 
in the V directions in V n , and does nothing in the other directions. If one 
thinks of V n as a direct sum of n copies of V, then T acts in the same way 
as T on each of these copies, with no mixing between the copies. 

It is not hard to verify that the correspondence T t—> T from C{V) to 
C{V n ) is linear and cooperates with compositions, so that S oT = S o T 
for all S, T G C(V). Also, if T is the identity operator on V, then T is the 
identity operator on V n . Set 

(2.33) An = {f:Te A}. 

From the preceding remarks it follows that A„ is an algebra of operators on 
V n , since A is an algebra of operators on V. 

Proposition 2.34 (A") n = {An)". 

Here {A") n = {T : T G A in analogy with A n , while ( A n )" denotes 
the double commutant of A n as an algebra of operators in C(V n ). Thus the 
proposition states that one gets the same result whether one first takes the 
double commutant of A, and then uses the correspondence T i —* T to pass 
to £(H n ), or one first uses this correspondence to pass to £(V n ), and then 
takes the double commutant there. 

To prove the proposition, let us look first at what {An)' is. For £ = 
1,2,..., n, let Pi denote the natural projection of V n onto the fill copy of V 
inside V n , defined by 

(2-35) Pe{f){j) = f(j) when j — 

= 0 when j ^ t. 
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Thus Pf, o P f — () when h ^ l, and 

n 

(2.36) = identity transformation on V n . 

£=1 

For any T E £(V), 

(2.37) " ToP e = P t of, 

by the way that T is defined. Now let M be any linear transformation on 
V n , and set M h / — P h o M o P e . To say that M lies in ( A n )' means that 

(2.38) M o f = f o M 
for all T E A. ffence M E ( A n )' if and only if 

(2.39) Mh./ ° T = T o Mh/ 

for all T E A and h, £ — 1,2 ,, n, because of the preceding observations. 
For £ — 1, 2,..., n, let : V —> V n denote the linear mapping defined by 

(2.40) = v when j — £ 

= 0 when j ^ i. 

In other words, 0£ is the obvious identitification between V and the 7th 
copy of V inside V n . For each linear transformation M on V n and each 
h, £ = 1, 2,..., n, let M h ( , be the linear transformation on V such that 

(2.41) Mh,e ° = 9h° Mh/■ 


By construction, Mh/ essentially corresponds to a linear mapping from the 
7th copy of V in V n to the hth copy of V in V n , and with M h j we rewrite 
this as a mapping on V in the obvious way. 

We also have that 

(2.42) fo6 t = 6 l oT 


for all T E C{V) and £ = 1,2,... ,n, again just by the way that everything 
is defined here. The bottom line is that (|2.39|) is equivalent to 


(2.43) 


Mh/ o T = T o Mh/ 


for all T E A and h, £ = 1, 2,..., n. To summarize, M E C(V n ) lies in ( A n )' 
if and only if the “blocks” Mh/, 1 < h, £ < n, in C(V) all lie in A 1 . 
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Now suppose that S G C{V) lies in A". Thus S commutes with all linear 
operators on V in A !. Using this, it is not hard to show that 


(2.44) 


MoS = SoM 


for all M G C(V n ) which lie in (A n )'. To be more precise, (|2.44j) holds if and 
only if 

(2.45) M hA oS = So M h , e 

for all h, £ — 1,2,... ,n, as before. This is in turn equivalent to 


(2.46) 


M h/ oS = So M m 


for h,t = 1,2,... ,7i, as operators now on V instead of V n . This condition 
holds for S G A' because M h j lies in A for h, £ — 1, 2,..., n. 

From (|2.44| ) we obtain the inclusion 

(2.47) (A') n C {An)". 

Next we wish to show that 


(2.48) {An)" C (^'On- 

Let M G C(V n ) be any element of (A n )". The first point is that each 
of the n copies of V inside V n are invariant under M , which is the same as 
saying that 

(2.49) M h / = 0 when h ^ i. 

This follows from the fact that each Pi lies in {An)', so that M commutes 
with each Pp. 

If 7r is any permutation on {1, 2 ,,n}, consider the linear transformation 
on V n which takes f{j) in V n to / (7r(j)). This transformation commutes with 
T on V n for any T G C{V), and thus this transformation lies in {A n )'- 

As a result, M in (A n )" commutes with all of these linear transformations 
on C(V n ) obtained from permutations on {1,2 ,... ,n}. Using this it is not 
hard to verify that M = R for some linear operator R on V. 

The remaining observation is that R lies in A". Indeed, if S is any linear 
operator on V which lies in A, then it is easy to see that S lies in (An)'- This 
implies that M commutes with S, and hence R commutes with S. Therefore 
R G A', since this applies to any S in A . In other words, M — R lies in 
(A") n , as desired. This completes the proof of Proposition [2.34 . 
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Suppose now that k is a symmetric field and that V is equipped with an 
inner product (■,•). We define an inner product (•, -) n on V n by 

n 

( 2 -50) (f,h) n = J2(fti)i h ti)) 

3 = 1 

for all /, h £ V n . In this way V n is an orthogonal direct sum of n copies of 
V, all with the same inner product as on V. 

Lemma 2.51 For each linear operator T on V, (T*) = (T)* (where T* is 
defined in terms of the inner product on V, while (T)* is defined using the 
inner product on V n ). 

This is easy to check. 

Corollary 2.52 Under the conditions above, if A is a *-algebra of operators 
on V, then A n is a *-algebra of operators on V n , and hence is nice. 

Now assume that k is a field, V is a vector space over k, and A is the 
algebra of operators on V generated by a representation p of a finite group 
G on V. We can associate to p a representation p n of G on V n in a simple 
way, namely by setting 
(2.53) ^ //; fi, 

for each x in G. 


Lemma 2.54 Under the conditions just described, A n is the algebra of op¬ 
erators generated by the representation p n of G on V n . 

This follows easily from the definitions. 


Corollary 2.55 If we also assume that either k has characteristic 0, or k 
has positive characteristic and the number of elements of the group G is not 
divisible by the characteristic of k, then A n is nice. 


This uses Lemma 2.22 in Subsection 2.4 
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2.7 Very nice algebras of operators 

Definition 2.56 Let k be a field and V a vector space over k. An algebra of 
operators A on V is said to be very nice if A n C C(V n ) (defined in Subsection 
fAtf ) is a nice algebra of operators for each positive integer n. 


The n — 1 case corresponds to nice algebras of operators. From Corol¬ 
laries 2.52 and 2.55 we know that A is very nice if either it is a *-algebra, 
assuming that k is a symmetric field and that V is equipped with an in¬ 
ner product, or if A is generated by a representation of a finite group on V 
and either the characteristic of k is 0 or the number of elements of G is not 
divisible by the characteristic of k when it is positive. 


Theorem 2.57 Let k be a field, V a vector space over k, and A an algebra 
of operators on V. If A is very nice, then A" = A. 


This is a relative of the “double commutant theorem”, and for the proof 
(including some of the preliminary steps in the previous subsections) we are 
essentially following the argument indicated on pi 18 of |[Doii|| . As in [ |Doii|| , 
this argument applies to “Von Neumann algebras” (with suitable adjust¬ 
ments) . 

Let k, V, and A be as in the statement of the theorem. As in Q2.3| ) in 
Subsection |2d], A C A" holds automatically, and so we need only show that 


A" C A. 

Let us first check that for each v G V and S G A" there is a T E A such 
that S(v) = T(v), assuming only that A is nice. Fix v G V, and set 


(2.58) 


W = {T(v) :T eA}. 


It is easy to see that W is a vector subspace of V which is invariant under A, 
because A is an algebra of operators. If A is nice, then W is also invariant 
under A". Of course v G W, since I G A, and therefore S(v) G W for any 
S G A", because W is invariant under A". This shows that for each S G A " 
there is a T G A such that T(v) = S(v), by the definition of W. 

Now let n be a positive integer, and let V n , A n , etc., be as in Subsection 
P~0| . The preceding observation can be applied to V n and A n (assuming that 
A is very nice, so that A n is nice) to obtain that for each / G V n and each 
M G (An)" there is an N G A n such that N(f) = M(f). 

From Proposition [2.34| we know that ( A n )'' = (A") n . Thus the previous 
assertion can be rephrased as saying that for each / G V n and each M G 
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(A!') n there is an N G A n such that N(f) = M(f). This can be rephrased 
again as saying that for each / G V n and each S G A" there is a T G A such 
that 

(2.59) T(f) — S(f). 

This is in turn equivalent to the following statement. If tq, v 2 ,..., v n are 
elements of V, and if S G A", then there is a T G A such that T{vf) = S(vj ) 
for J = 1,2,.. . ,n. 

If n is the dimension of V, then we can choose v±,V 2 , ■ ■ ■, v n to be a basis 
of V. In this case the preceding statement reduces to saying that if S G A", 
then there is a T G A such that T = S. This implies that A" C A , and 
hence A!' = A, since the other inclusion is automatic. This completes the 
proof of the theorem. 


2.8 Some other approaches 

Let k be a held, V a vector space over k, and A an algebra of operators on 
V. Since A C A!' automatically, another way to try to show that A = A!' is 
to show that the dimension of A is equal to the dimension of A", as vector 
spaces over k. Of course this is easier to do if A" is known and reasonably 
simple. 

If A! consists of only scalar multiples of the identity operator on V, then 
A" = jC(V), and the dimension of A" is equal to the square of the dimension 
of V. In this case we can just say that A = C(V) if A has dimension equal 
to the square of the dimension of V. Let us give a couple of other criteria 
for A to be equal to C(V). 


Lemma 2.60 Let A be an algebra of linear operators on V which satisfies 
the following two conditions: (a) for any vectors v, w in V with v 0 there 
is a linear operator T in A such that T(v) = w; (b) there is a nonzero linear 
operator on V with rank 1 which lies in A. Then A = C{V). 


Condition (a) in the hypothesis is equivalent to asking that A be irre¬ 
ducible in the sense of Definition WX. 


To show that A = £(D), it is enough to show that every linear operator 
on V with rank 1 lies in A. By assumption, there is a nonzero operator R 
of rank 1 which lies in A, and we can write R as R(u) = f(u)z, where z 
is a nonzero element of V, and / is a nonzero linear mapping from V to 
k, i.e., a nonzero linear functional on V. If w is any other nonzero vector 
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in v, then there is an operator T in A such that T(z) = w, and hence 
(T o R)(u) = f(u ) T(z) = f(u) w and T o R lies in A. 

Let h be any nonzero linear functional on V. We would like to show that 
there is a linear operator S in A such that h = / o S. If we can do that, then 
(T o R o S){u) = h(u) w and T o R o S lies in A, and we get that any linear 
operator on V of rank 1 lies in A. 

Consider the set 


(2.61) 


Z = {y E V : f o S(y) — 0 for all S G A}. 


The irreducibility assumption for A implies that Z contains only the zero 
vector in V. On the other hand, the set of linear functionals on V of the 
form / o S is a vector subspace of the vector space of all linear functionals 
on V. The set Z is exactly the intersection of the kernels of these linear 
functionals, and if they did not span the whole space of linear functionals on 
V, then Z would contain a nonzero vector. This is a basic result of linear 
algebra. Since Z does contain only the zero vector, we obtain that the set 
of linear functionals on V of the form / o S , S G A , is equal to the set of all 


linear functionals on V, as desired. This completes the proof of Lemma 2.6C 


Lemma 2.62 Let A be an algebra of linear operators on V such that for 
any pair v\, v 2 of linearly independent vectors in V and any arbitrary pair 
of vectors W\, w 2 in V there is a T in A such that T(v i) = Wi, T(v 2 ) = w 2 . 
Then A = C{V). 


We may as well assume that V has dimension at least 2 , since the conclu¬ 
sion of the lemma is automatic when the dimension of Id is 1. The hypothesis 
of the lemma is also vacuous in this case. Because of Lemma |2.60| , it suffices 
to show that A contains a nonzero operator of rank 1 . 

Let R be any nonzero operator in A. If R has rank 1 , then we are 
finished, and so we assume that R has rank at least 2. This means that there 
are vectors U\, u 2 in V such that R(u±), R(u 2 ) are linearly independent. 
By assumption, there is an operator T in A such that T(R(ui)) = 0 and 
T(R(u 2 )) 7 ^ 0. Thus T o R is nonzero and has rank smaller than the rank 
of R , because the dimension of the image has been reduced, or, equivalently, 
the dimension of the kernel has been increased. We also have that T o R is 
in A, since R and T are elements of A. By repeating this process, as needed, 
we can get a nonzero element of A of rank 1 . 
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Lemma 2.63 Suppose that A is an algebra of linear operators on V such 
that A! consists of only the scalar multiples of the identity and A n C C{V n ) 
(defined in Subsection \ 2. dj ) is a nice algebra of operators when n — 2. Then 

a = av) = A". 

The assumptions in this lemma imply the hypothesis of Lemma |2.62 . 


3 Vector spaces with definite scalar product 

Let ^ be a field and let V be a vector space. Suppose that (v, w ) is a 
definite scalar product on V, by which we mean the following: (a) ( v , w ) 
is a bilinear form on V, i.e., it is a function from V x V into k such that 
v i—► (u, w) is linear for each w in V, and w t—> (v, w) is linear for each v 
in V; (b) (v,w) is symmetric in v and w, so that (v,w) = (w,v) for all 
v and w in V; and (c) (v, v) = 0 if and only if v = 0. This last can be a 
somewhat complicated condition, which does not necessarily entail positivity. 
See [|Ser3|| , for instance. For finite fields any homogeneous polynomial of 
degree 2 in at least 3 variables has a nontrivial solution, as on p338 of ||Cas| 
and Corollary 2 on p6 of ||Ser3 


Two vectors v, w in V are said to be orthogonal if (v, w) — 0. A collection 
of vectors V \,..., ry is said to be orthogonal if ( Vj,vi ) = 0 for all j, l such 
that 1 < j, l < k and j ^ l. As usual, nonzero orthogonal vectors are linearly 
independent. 

If W is a linear subspace of V, then we define the orthogonal complement 
U 1 - of U by 

(3.1) y = {vev : (v, u) = 0 for all u G U}. 

Thus U 1 - is also a linear subspace of V, and 


(3.2) 


unu 1 = {o}, 


since our scalar product is assumed to be definite. 

Suppose that u\,... ,u n is a collection of nonzero orthogonal vectors in 
V, and let U denote the span of u\, ..., u n . Define a linear operator P on V 
by 


(3.3) 


n 


p(v) = E 

3 = 1 


(v,Uj) 
7 - 

( Uj , Uj) 
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We have that P(u) = u when u G U, P(v) = 0 when v G U L , and 

(3.4) P(w) EU, w- P(w) G U L for all w G V. 

For any w in V, there is at most one vector y in V such that y G U and 
w — y£ U- 1 , because if y' is another such vector, then y — y'^U flf/A Hence 
P is characterized by (p.4|) , and depends only on U, and not the choice of 
Ui,... ,u n . We say that P is the orthogonal projection of V onto U. 


Lemma 3.5 If x i,..., x n are linearly independent vectors in V, then there 
are nonzero orthogonal vectors ui,...,u n in V such that span(aq,..., x n ) = 
span(tti, ...,u n ). 

As usual, this can be proved using induction on n. The orthogonaliza- 
tion of the last vector is obtained with the help of an orthogonal projection 
onto the span of the previous vectors, where the orthogonal projection just 
mentioned is derived from the induction hypothesis. 


Corollary 3.6 IfU is any linear subspace ofV, then there is an orthogonal 
projection of V onto U. In particular, V is spanned by U and U L , and 
(C/ ± ) ± = U. 


Now let T be a linear operator on V. The transpose of T is the linear 
operator T l on V such that 

(3.7) (T(v),w) — (v,T t (w)) 


for all v, w in V. It is not difficult to establish the existence of such an 
operator T 4 , and uniqueness is an easy consequence of fl3.7|) . 

Note that 

(3.8) (aS + pT) = aS t + f3T l 


and 

(3.9) (S o Tf = T t o S t 


for any scalars a, (3 and linear operators S, T on V. The transpose of the 
identity operator / on V is itself, and 


(3.10) 


( T< )t = T 


for any linear operator T on V. 
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Definition 3.11 Let A be an algebra of linear operators on V, as in Defi- 
We say that A is a t-algebra ifT 1 lies in A whenever T does. 


nition 2.1 


Of course the algebra C(V) of all operators on V is a t-algebra, as is the 
algebra consisting only of scalar multiples of the identity. 


Lemma 3.12 If A is a t-algebra of linear operators on V, then the commu- 
tant A! is also a t-algebra of linear operators on V. 


This is a simple exercise. 

Lemma 3.13 Let A be an algebra of operators on V, and let U be a linear 
subspace of V which is invariant under A. If A is a t-algebra, then U 1 - is 
invariant under A, and the orthogonal projection Pu ofV onto U lies in A'. 


The statement that U is invariant under A is equivalent to 


(3.14) (T(u),v) = 0 

for all u G U, v G U x , and T G A. This implies that 


(3.15) 


{u,T\v)) = 0 


for all u G U, v G U 1 - , and T G A, so that U 1 - is invariant under T l for all T 
in A. If A is a t-algebra, then it follows that U 1 is invariant under A. The 
information that U and U L are both invariant under A implies that Pjj G A !, 


as in Lemma 2.10 


As a result, if A is a t-algebra of operators on V, then A is a nice algebra 
of operators, in the sense of Definition [2.7| . 

Let n be a positive integer, and define V n , A n as in Subsection [T6|. Fix 
nonzero elements Ai,..., A n in k, and consider 


(3.16) 


(M)» = £V(/(j),fc(j)), Mer 


3 = 1 


In general, this will not be definite, but if it is, then it is easy to see that A n 
is a nice algebra of operators on V n , because it is a t-algebra with respect to 
(/, h) n , as in Corollary [2.52 . 
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4 Irreducibility 

Definition 4.1 Let V be a vector space over a field k. An algebra of op¬ 
erators A on V is said to be irreducible if there are no vector subspaces W 
of V which are invariant under A (Definition WS\ ) except for W = {0} and 
W = V. 


Lemma 4.2 An algebra of operators A on a vector space V over a field k is 
irreducible if and only if for every pair of vectors v, w in V with v 0 there 
is an operator T in A such that T(v) = w. 

The “if” part is a simple consequence of the definition. For the “only if” 
part, let v be a nonzero vector in V, and consider the set of vectors in V of 
the form T(v), T E A. This set of vectors is a vector subspace of V which 
is invariant under A, because A is an algebra of operators, and it is nonzero 
because the identity operator lies in A, so that v lies in the subspace. Thus 
irreducibility of A implies that this subspace is all of V, which means exactly 
that for every vector w in V there is an operator T in A such that T(v) = w. 


Lemma 4.3 IfV is a vector space, G is a finite group, p is a representation 
of G on V, and A is the algebra of operators on V generated by p x for x in 
G as in Subsection then p is an irreducible representation of G if and 
only if A is irreducible. 


This follows from Lemma 12.211 in Subsection 1274. 


Proposition 4.4 Let V be a vector space over a field k, and let A be an 
algebra of operators on V. If A is irreducible, then for each T in the conn- 
mutant A!, either T = 0 or T is invertible as an operator on V. In the latter 
event T” 1 also lies in A'. 


If T lies in A!, then the kernel and image of T are invariant subspaces 
for A, as in Lemma j2.9[ From this it is easy to see that either T is the zero 
operator or T is invertible when A is irreducible. Proposition [2.5| implies 
that T _1 lies in A! when T is invertible, since A! is an algebra of operators. 


Proposition 4.5 Let V be a vector space over an algebraically closed field 
k. If A is an irreducible algebra of operators on V, then the commutant A! 
is equal to the set of multiples of the identity operator by scalars (elements 
ofk). 
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The commutant of any algebra always contains the scalar multiples of the 
identity, since the identity operator commutes with all other linear mappings, 
and so it suffices to show that every linear transformation in A is a multiple 
of the identity. This is in essence the same as one-half of the classical Schur’s 
lemma , which can be proved as follows. 

Let T be any linear operator in A 1 . Because k is algebraically closed, 
there is at least one nontrivial eigenvalue A € k of T. In other words, there 
is at least one A in k for which the eigenspace 

(4.6) E(T, A) = {v e V : T(v) = A v} 

contains nonzero vectors. This is because A is an eigenvalue of T if and only 
if det(T — A I) =0, and det(T — A I) is a polynomial in A (of degree equal to 
the dimension of V, which is positive). 

If E(T, A) = V, then T = XI, which is what we want. Thus we suppose 
that E(T , A) is not all of V. It is not hard to check that if 5 is a linear 
operator on V which commutes with T, then 


(4.7) 


S(E(T,\))CE(T,\)- 


In particular, this holds for all S G A, since T G A! by hypothesis. Hence 
E(T , A) is an invariant subspace of A, in contradiction to the assumption 
that A is irreducible. We conclude that E[T , A) = V. 

An alternative approach is to say that if T lies in A, then T generates a 
field extension of k of finite degree (since A C C{V) has finite dimension over 
k), and that this field extension should be equal to k when k is algebraically 
closed. 

Now let us consider the case of a vector space V over the real numbers 
R. Suppose as before that A is an algebra of operators on V which is 
irreducible. One can start with Proposition [f.4| and apply a famous result of 
Frobenius (as in |[Alb|| ) to conclude that A either consists of real multiples 
of the identity, or that A is isomorphic to the complex numbers or to the 
quaternions, as algebras over the real numbers. Here we shall describe a 
different (also classical) analysis, under the additional assumption that V is 
equipped with an inner product (-, •) and that A is a *-algebra. A broader 
discussion can be found in ||( 'ur 1| . [Hai|| , which also allows for nonassociative 
algebras. 


Lemma 4.8 If T e A, then T = 7\ + T 2 , where Tf = T 1; Tf = —T 2 , and 
T u T 2 e A. 
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As usual, one can take 7\ = (T + T*)/ 2 and (T — T*)/2 to get T), T 2 so 
that T = Ti + T 2 , T* — Ti, and T 2 * = — T 2 . Because A is a *-algebra, we 
also have that T), T 2 £ A' when T & Ai. 

Lemma 4.9 If T £ A is symmetric, T* = T, then T = a I for some real 
number a. 


For this we use the fact that T is diagonalizable, since it is symmetric. 
We really only need that T has an eigenvalue, as in the setting of Proposition 
O. The eigenspace is an invariant subspace for A, for the same reason as 
before, and hence the eigenspace is all of V, by the irreducibility of V. This 
says exactly that T is a scalar multiple of the identity. 

If R and T are elements of A, then (R* T + T* R )/2 is an element of A, 
since A is a *-algebra, and it is also symmetric as an operator on V. Hence 
it is a real number times the identity operator, by Lemma |4.9| . Let (R, T) 
denote this real number, so that 


(4.10) 


(R’T + r R)/2 = ( R,T)I. 


It is easy to see that (. R , T ) is linear in R and in T, and symmetric in R and 
T. 

If R = T, then we can rewrite (|4.10|) as 

(4.11) R* R = (R, R) I. 

If v is any vector in V, then 

(4.12) (R(v),R(i 0) = {(R*R){v),v) = (R,R) (v,v). 

This implies that ( R , R) is always nonnegative, and that it is equal to 0 
exactly when R = 0. Thus (R, T ) defines an inner product on A. If ( R , R) > 
0, we also get that R is invertible. 

Next, if R,\ , R /2 £ A , then 


(4.13) 

This is because 


(Ri R 2 , R\ R 2 ) — (i?i, R\) (R 2 , R 2 ). 


(4.14) (R 1 R 2 )*(R 1 R 2 ) = R* 2 R* 1 R 1 R 2 = {R 1 ,R 1 )R*R 2 

— (Ri,Ri) {R2,R2)i- 
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Now let us focus somewhat on antisymmetric operators in A! . If R G A! 
is antisymmetric, then 

(4.15) R 2 = -(R,R)I. 

If R ,\, R -2 € A! are antisymmetric and {R ,\, R 2 ) = 0, then R\ R 2 + R 2 R\ = 0, 
which reduces to 

(4.16) R\ R 2 = —R 2 R.\ ■ 

In this case we also have that 


(4.17) {R 1 R 2 y = R* 2 Rl = R 2 R 1 = -R 1 R 2 , 

so that R\ R 2 is also antisymmetric. 

Lemma 4.18 Suppose that R\ and R 2 are antisymmetric elements of A! 
such that (Ri,R 2 ) = 0. Then 

(4.19) i?i (i?i R 2 ) = — (R\ R 2 ) Ri, R 2 (R\ R 2 ) = — (i?i R 2 ) R 2 , 
these elements of A' are antisymmetric, and 

(4.20) ( R \, R\ R 2 ) = (7?2, Ri R 2 ) = 0. 


This is not hard to check. 

Assume for the moment that 7\ and T 2 are nonzero antisymmetric ele¬ 
ments of A! such that (T 1; T 2 ) =0, and that the set of antisymmetric elements 
of A! is not spanned by T\ , T 2 , and TiT 2 . Then there is another nonzero 
antisymmetric element U of Ad such that 

(4.21) ( U , Tf) = (U, T 2 ) = (U, T, T 2 ) = 0. 


We know from the observations above that 7\ T 2 is antisymmetric, and hence 
(U, Ti T 2 ) = 0 implies that 


(4.22) 


U(T 1 T 2 ) = -(T 1 T 2 )U, 


as in (|4.16|). Similarly, 


(4.23) UT X = -T, U and UT 2 = -T 2 U, 
since (U, T x ) = (U , T 2 ) = 0. These two equations imply that 

(4.24) UiT^) = (UT l )T 2 = -(T l U)T 2 

= —T x (UT 2 ) = —T\ (—T 2 U) = (T x T 2 ) U. 
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This together with ( |4.22| ) lead to 

(4.25) U (Ti T 2 ) = 0. 


This is a contradiction, because Ti, T 2 , and U are assumed to be nonzero 
elements of A! , and hence are invertible, so that the product cannot be 0. 

The conclusion of this is that if Tf and T 2 are nonzero antisymmetric 
elements of A' , then the set of antisymmetric elements of A ' is spanned by 
T], T 2 , and T 3 . 

Thus we obtain three possibilities for A'. The first is that there are no 
nonzero antisymmetric elements of A'. In other words, all elements of A! 
are symmetric, because of Lemma 4.8. From Lemma |4.9| it follows that A! 


consists exactly of multiples of the identity operator by real numbers. 

The second possibility is that there is a nonzero antisymmetric element 
J of A', and that all other antisymmetric elements of A! are multiples of J 
by real numbers. 

The third possibility is that A' contains nonzero antisymmetric elements 
T\ , T 2 such that (7), T 2 ) = 0. In this case T 3 T 2 is also a nonzero anti¬ 
symmetric element of A', Ti T 2 is orthogonal to T\ and T 2 relative to the 
inner product (•, •) on A', and all antisymmetric elements of A! are linear 
combinations of Ti, T 2 , and T\ T 2 . 

These are the only possibilities for A '. In other words, the dimension of 
the real vector space of antisymmetric elements of A' is either 0, 1, or greater 
than or equal to 2, and these cases correspond exactly to the possibilities just 
described. In particular, if the vector space of antisymmetric elements of A! 
has dimension greater than or equal to 2, then the dimension is in fact equal 
to 3. 

In the first situation, where A! consists of only multiples of the identity, 
it follows from Theorem L57 that A is equal to the algebra £( V) of all linear 
transformations on V. Note that if A = C.{V), then A' is equal to the set of 
multiples of the identity operator by real numbers. 

In the second situation, there is a nonzero antisymmetric element J of A! 
such that all other antisymmetric elements of A! are equal to real numbers 
times J. We may as well assume that (J, J) = 1, since we can multiply J by 
a positive real number to obtain this. From ( [4.15| ) we obtain that 

(4.26) J 2 = -/. 


We also have that 

(4.27) (J(v), J(v)) = (v,v) 
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for all v G V, as in ( 4.12| ). (Notice that any two of the conditions J* = —J, 
(|4.26|) , and (|4.27|) inplies the third.) 

In general, if J is a linear operator on a real vector space that satisfies 
J 2 = —I, then J defines a complex structure on the vector space. That is, 
one can use J as a definition for multiplication by i on the vector space (and 
this choice satisfies the requirements of a complex vector space). A real linear 
operator on the vector space that commutes with J is the same as a complex- 
linear operator on the vector space, with respect to this complex structure. If 
the vector space is equipped with a (real) inner product, then the additional 
condition of antisymmetry, or equivalently ([4.271) , is a compatibility property 
of the complex structure with the inner product. 

Here we have such an operator J, and A 1 is spanned by the identity 
operator / and J. By Theorem [2.571 , A = A" is the set of linear operators 
on V which commute with J, i.e., which are complex-linear with respect to 
the complex structure defined by J. 


Conversely, one could start with a linear operator J on V such that (|4.26|) 
holds, and then define A to be the set of linear operators on V which are 
complex-linear with respect to the complex structure defined by J. It is easy 
to see that A is then an algebra. If J is compatible with the inner product 
(•, •), in the sense of (|4.27|) (or, equivalently, by being antisymmetric), then 
one can check that A is a ^-algebra. Also, A! is equal to the span of J and 
the identity operator in this case. 

The third situation is analogous to the second one, except for having a 
“quaternionic structure” on V rather than a complex structure, correspond¬ 
ing to the larger family of operators in A 1 . Let us briefly review some aspects 
of the quaternions. 

Just as one might think of a complex number as being a number of the 
form a + bi, where i 2 = — 1 and i commutes with all real numbers, one can 
think of a quaternion as being something which can be expressed in a unique 
way as 

(4.28) a + bi + cj + dk, 

where a, b, c, and d are real numbers, and i, j, and k are special quaternions 
analogous to i in the complex numbers. Specifically, i, j, and k commute 
with real numbers and satisfy 


(4.29) 


i 2 = f = k 2 = 


■1, ij = —ji = k. 
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As a result of the latter equations, one also has 

(4.30) ik = —ki, jk = —kj. 

If x — a + bi + cj + dk is a quaternion, then its conjugate x is defined by 

(4.31) x = a — bi — cj — dk, 

in analogy with complex conjugation. As in the setting of complex numbers, 
one can check that 

(4.32) xx = xx = a 2 + b 2 + c 2 + d 2 , 

where the right side is always a real number. One defines the “norm” or 
“modulus” |x| of x to be \Jxx. This is the same as the usual Euclidean 
norm of (a, b, c, d) G R 4 . Clearly the conjugate of the conjugate of re is a; 
again, and one can verify that 

(4.33) (xy) = yx 
for all quaternions x and y. As a result, 

(4.34) \xy\ 2 = (xy) (xy) = xyyx = \x\ 2 \y\ 2 

for all quaternions x, y, i.e., \xy\ = |rc| \y\. 

In the third situation above, A' is isomorphic to the quaternions in a 
natural sense. One can think of the identity operator in A 1 as corresponding 
to the real number 1 in the quaternions, and in general the real multiples of 
the identity operator in A correspond to the real numbers inside the quater¬ 
nions. The antisymmetric operators in A correspond to the “imaginary” 
quaternions, which are the quaternions of the form bi + cj + dk. There 
are not necessarily specific counterparts for i, j, and k in A , but one can 
use any antisymmetric operators R\ , i? 2 , and R 3 such that R 3 = R 1 R 2 , 
(i?i, R\) = (R- 2 , R 2 ) = 1, and (R 3 ,R 2 ) = 0 (as in the earlier discussion). 

This leads to a one-to-one correspondence between A and the quaternions 
that preserves sums and products. Similarly, the conjugation operation x 
x on the quaternions matches with the adjoint operation T 1 —> T* on A . The 
norm on A coming from the inner product (•, •) matches with the norm on 
quaternions. 

In this manner the operators in A define a quaternionic structure on 
V. That is, a complex structure on a real vector space gives a way to have 
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complex numbers operate on vectors in the vector space, and now we have a 
way for the quaternions to operate on a vector space, in a way that extends 
the usual scalar multiplication by real numbers. This quaternionic structure 
is also compatible with the inner product on V , in the sense that a quaternion 
with norm 1 corresponds to an operator on V which preserves the inner 
product there. This uses (|4.11|) . 

Theorem |2.57| says that A = A ", so that A consists exactly of the real- 
linear operators on V that commute with the operators that come from the 
quaternionic structure (i.e., the operators in A' in this case). In other words, 
A consists exactly of the operators which are “quaternionic-linear” on V 
with respect to this quaternionic structure coming from A', analogous to the 
second situation and complex-linear operators with respect to the complex 
structure on V that arose there. 


5 Division algebras of operators 

Definition 5.1 Let B be an algebra of operators on a vector space V over 
a field k. Suppose that for each T in B, either T = 0 or T is an invertible 
operator on V (so that T l lies in B, by Proposition \2.3j ). In this case B is 
said to be a division algebra of operators. 

For the rest of this section we assume that k is a field, V is a vector space 
over k, and B is a division algebra of operators on V. 

If vi ,..., v m are vectors in V, then we say that they are B-independent if 

(5.2) T\(v i) + • • • + T m (v m ) = 0 
implies that 

(5.3) Ti — ■ ■ ■ — T m — 0 

for all 7),..., T rn in B. The B-span of v x ,..., v m is defined to be the subspace 
of V consisting of vectors of the form 

(5.4) T\(v i) + • • • + T m (v m ), 

where Ti,... ,T m lie in B. Thus ^-independence of Vi,... ,v m is equivalent 
to saying that if a vector can be represented as (|5.4| ) with Ti,... ,T m e B, 
then this representation is unique. 

Note that a single nonzero vector in V is ^-independent. This uses the 
assumption that an operator in B is either 0 or invertible. 
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Lemma 5.5 Assume that V\,... ,v m are B-independent vectors in V. If w 
is a vector in V that does not lie in the B-span ofv i,..., v m , then the vectors 
Vi ,..., v m , w are B-independent. 

Indeed, suppose that Ti,, T m , U are operators in B such that 

(5.6) Ti(vi)-{ - hT m (v m ) + U(w) = 0. 

We would like to show that T\ — ■ ■ ■ — T m — U — 0. If U = 0, then this 
reduces to the ^-independence of v\,... ,v m . If U ^ 0, then U is invertible, 
by our assumptions on B , and we can rewrite ( |5.6|) as 

(5.7) w = -(I /- 1 o HXm)-(I /- 1 O T m )(v m ). 

This implies that w is in the /5-span of v±,... ,v m , contrary to hypothesis, 
and Lemma |5.5| follows. 

Corollary 5.8 There are vectors ui,...,ue in V which are B-independent 
and whose B-span is equal to V. 

To see this, one can start with a single nonzero vector u± in V. If the 
/5-span of this vector is all of V, then we stop. Otherwise, we choose a vector 
U 2 which is not in the /5-span of U\. Lemma |5.5| implies that U\,U 2 are B- 
independent. If the /5-span of Ui,u 2 is all of V, then we stop, and otherwise 
we choose u 3 to be a vector in V not in the /5-span of ui,U 2 - Proceeding in 
this manner, one eventually gets a collection of vectors u\, ..., in V as in 
the corollary. 

Corollary 5.9 The dimension of V is divisible by the dimension of B (as 
vector spaces over k). 

Specifically, the dimension of V is equal to I times the dimension of B, if 
I is as in the previous corollary. 

Corollary 5.10 Suppose that W is a nonzero vector subspace ofV which is 
invariant under B. There are vectors u\, ...,«£ in V and a positive integer 
m such that ui,... ,ue are B-independent, the B-span of U\,... ,ug is equal 
to V, Uj lies in W exactly when j < m, and W is equal to the B-span of the 
Uj’s, j < m. 
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This can be established in the same manner as the Corollary |5.8| . The 
main difference is that in the beginning of the construction one chooses the 
Uj s in W, for as long as one can. One stops doing this exactly when there 
are enough u/s so that their /5-span is W. One then continues as before, 
with additional u /s as needed so that the /5-span of the total collection of 
vectors is V. 


Corollary 5.11 IfW is a nonzero vector subspace ofV which is invariant 
under B and which is not equal to V, then there is a vector subspace Z of 
V which is invariant under B and which is a complement of W (so that 
W HZ = {0} andW + Z = V). 

For this one can take Z to be the /5-span of u m+ 1 ,..., ui, where u±,... ,ui 
is as in Corollary |5.1C . 

Now let us consider B'. Let ui,...,ug be vectors in V which are B- 
independent and whose /5-span is equal to V, as in Corollary |5.^ . If R G B', 
then R is determined uniquely by R(u ±),..., R(u(). For if T G B, then 
R{T{uj )) = T(R(uj )) for each j, and this and linearity determine R on all 
of V. 

Conversely, if uq,..., wn are arbitrary vectors in V, then there is an R G 
B' so that R{uj) = Wj for each j. One can simply define R on V by 

(5.12) R(Ti(u\) + • ■ • + Tg{ug)) = Ti(w\) + • • • + Tn{wf), 

for all Ti,..., Ti G B. It is easy to check that this gives a linear operator R 
on V which lies in B'. 


Lemma 5.13 The dimension of B' is equal to the square of the dimension 
ofV divided by the dimension of B, as vector spaces over k. 

This follows from the preceding description of B'. 

Proposition 5.14 B" = B. 


This can be verified directly, using our characterization of B'. Let us 
mention another argument, in terms of the methods of Section [2|. It is enough 
to show that B is a very nice algebra of operators, by Theorem |2.57| in 
Subsection Using Lemma |2.10| in Subsection |2.2| and Corollary |5.11| one 
obtains that B is a nice algebra of operators (Definition [2]7]) . To say that B 


45 












is very nice means (Definition |2.56|) that the expanded algebras B n defined 
in Subsection [2(6] are all nice. By construction, the expanded algebras B n 
are all division algebras of operators, and hence they are nice for the same 
reason that B is. 

Next we discuss some analogues of the results in Subsection fh 8 [ . Let us 
make the standing assumption for the rest of the section that 


(5.15) 


A is an algebra of operators on V such that AC. B'. 


Lemma 5.16 If the dimension ofV is equal to the dimension ofB, as vector 
spaces over k, then A = B' if and only if A is irreducible. 


The assumption that the dimension of V is equal to the dimension of B 
implies that V is equal to the B-span of any nonzero vector in V. Using this 
and the reformulation of irreducibility in Lemma [L^ it is easy to check that 
A = B' when A is irreducible. For the converse, notice that B' is always 
irreducible, without any restriction on the dimension of V. 


Lemma 5.17 If A is irreducible and there is a nonzero operator in A whose 
image is contained in the B-span of a single vector, then A = B'. 


The assumptions in this lemma are also necessary for A to be equal to 
B'. 

For each nonzero vector u in V, consider the vector space of linear oper¬ 
ators on V which lie in B' and have image contained in the B-span of u. We 
would like to show that this vector space of linear operators is contained in 
A. If we can do this for any nonzero u in V, then it follows that A = B'. 

Fix u 7 ^ 0 in V. Observe first that there is a nonzero linear operator on V 
which lies in A and has image contained in the B-span of u. The hypothesis 
of the lemma says that this is true for at least one iq, and one can get any 
other u by composing with an operator in A that sends rq to u. Such an 
operator exists, by irreducibility. 

As a vector space over k , the collection of operators in B' whose image 
is contained in the B-span of u has dimension equal to the dimension of V. 
This can be derived from the earlier description of B'. Thus it is enough 
to show that the collection of operators in A with image contained in the 
B-span of u has dimension which is at least the dimension of v. Let R be a 
nonzero operator in A whose image is contained in, and hence equal to, the 
B-span of u, and let z be a vector in V such that R(z) = u. For each vector 
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v in V there is a linear operator T in A such that T(y) = z. For this T we 
have that RoT lies in A, the image of Ro T is contained in the i5-span of u, 
and (RoT)(v) = u. Because we can do this for each v in V, it is not hard to 
see that the dimension of the collection of linear operators in A with image 
contained in the /5-span of u has dimension at least the dimension of V, as 
desired. 


Lemma 5.18 Suppose that A is irreducible, and that for every four vectors 
Vy, V‘ 2 , vj\, W 2 in V such that v\, u 2 are B-independent there is a T in A 
which satisfies T(v i) = w\, T(u 2 ) = u> 2 . Then A = B'. 


The assumption of irreducibility of A follows from the second condition 
when the dimension of V is larger than the dimension of B. When the two 
dimensions are equal, the second condition is vacuous, and one could just as 
well apply Lemma |5.16| . 

To prove Lemma 5.IS , it suffices to show that there is a nonzero operator 
in A whose image is contained in the /5-span of a single vector, by Lemma 
|5.17| . Let R be any nonzero operator in A. If the image of R is not contained 
in the /5-span of a single vector, then there exist aq, x 2 in V such that -R(aq), 
R(xf) are B- independent. By hypothesis, there is an operator T in A such 
that T(R(x i)) 7 ^ 0 and T(i?(;c 2 )) = 0. Thus T o R lies in A, is nonzero, 
and the dimension of its image strictly less than that of R. If the image of 
T o R is contained in the /5-span of a single vector, then we are finished, and 
otherwise we can repeat this process until such an operator is obtained. This 
proves the lemma. 


Lemma 5.19 Suppose that A! = B and A n C £( V n ) (defined in Subsection 
HTtf ) is a nice algebra of operators when n = 2. Then A = B' = A!'. 

The hypotheses of this lemma imply those of Lemma |5.1£ . 


6 Group representations and their algebras 

Throughout this section, G will be a finite group, and k will be a held. 

Proposition 6.1 Let Vj and V 2 be vector spaces over k, and let p 1 , p 2 be 
representations of G on Vj, Vj. Suppose that T : Vj —> Vj is a linear mapping 
which intertwines the representations p 1 , p 2 , in the sense that 

(6-2) T o pi = p 2 x o T 
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for all x in G. If the representations p 1 , p 2 are irreducible, then either T is 
0, or T is a one-to-one mapping from V\ onto V 2 , and the representations 
p 1 , p 2 are isomorphic to each other. 

This is one half of Schur’s lemma. (The other half came up in Section f|.) 

Let Vi , V 2 , etc., be as in the statement above. Consider the kernel of 
T , which is a vector subspace of . From the intertwining property it is 
easy to see that the kernel of T is invariant under the representation p 1 . 
The irreducibility of p 1 then implies that the kernel is either all of Vi or the 
subspace consisting of only the zero vector. In the first case T = 0, and we 
are finished. In the second case we get that T is one-to-one. 

Now consider the image of T in V 2 . This is a vector subspace that is 
invariant under p 2 , because of the intertwining property. The irreducibility 
of p 2 implies that the image of T is either the subspace of V 2 consisting of 
only the zero vector or all of V 2 . This is the same as saying that T is either 
equal to 0 or it maps V\ onto V 2 . Combining this with the conclusion of 
the preceding paragraph, we obtain that T is either equal to 0, or that it is 
one-to-one and maps V± onto V 2 . This proves the proposition. 

For the rest of this section, let us assume that 

(6.3) either the characteristic of k is 0, or it is positive and 

does not divide the number of elements of the group G. 


Let V be a vector space over k, and let p be a representation of G on 
V. As in Lemma |1.43| in Subsection |1.5| . there is an independent system of 
subspaces W±, ..., Wh of V such that the span of the W/ s is equal to V, each 
Wj is invariant under p, and the restriction of p to each Wj is irreducible. 
Let us make the following additional assumption: 


(6.4) for each j, l such that 1 < j,l < h and j 7 ^ l, the 

restriction of p to W 3 is not isomorphic to the 
restriction of p to Wi, as a representation of G. 


Notation 6.5 Suppose that A is an algebra of operators on a vector space 
V. If U is a vector subspace of V which is invariant under A (Definition j 2 . 6 | ), 
then we let A(U) denote the set of operators on U which are the restrictions 
to U of the operators in A. It is easy to see that A(U) is an algebra of 
operators on U. The commutant and double commutant of A(U) in C{U) 
are denoted A{U)' and A(U)". 
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In our present circumstances, we let A denote the algebra of operators 
on V generated by p as in Subsection pTd) . A subspace U of V is invariant 
under A if and only if it is invariant under p, and in this case A(U) is the 
same as the algebra generated by the restrictions of the pA s to U, x in G. 


Theorem 6.6 Under the preceding conditions, A consists of the linear op¬ 
erators T on V such that T(Wf) C W 3 and the restriction ofT to Wj lies in 
A(Wj) for each j, 1 < j < h. 

It is clear from the definitions that each T E A satisfies the properties 
described in the theorem. The key point is that in the converse, the restric¬ 
tions of T to the various Wj s can be chosen independently of each other. To 
prove this, we use Theorem |2.57| . 

Because W ±,..., Wh is an independent system of subspaces of V which 
spans V, there is for each j — 1,..., h a natural linear operator P 3 on V which 
projects V onto W 3 . Specifically, Pj is the linear operator which satisfies 
Pj(u ) = u when u E Wj and Pj(z) = 0 when z EWi, l A j • Thus Pj oP| = 0 
when j A l, and Ylj=i Pj — I, the identity operator on V. We also have that 
each Pj lies in A', because the W 3 's are invariant subspaces. 

Lemma 6.7 If S E A 1 and 1 < j,l < h, j A then Pi o S o Pj = 0. 

Let S, j, and l be given as in the lemma, and set S 3 ,[ — Pi o S o Pj. Note 
that Sjj lies in A', since S, Pj, and Pi do. Thus 

(6.8) Sfi o p x p x o S 3 / 

for all x in G (and this is equivalent to the statement that S 3 j E A! , since A 
is generated by the pA s). 

Let Rjj be the linear mapping from Wj to Wi which is the restriction of 
Sjj to Wj. Thus Rjj is essentially the same as S 3: i, except that it is formally 
viewed as a mapping from Wj to Wi rather than as an operator on V. On 
Wj and W] we have representations of G, namely, the restrictions of p to W 3 
and Wi. By hypothesis, these representations are irreducible and not isornor- 
phically equivalent. The mapping Rjj intertwines these representations, by 
(|6.8| ). Therefore, R, 3 j = 0, by Proposition ^T], This is equivalent to S 3 j = 0, 
and the lemma follows. 


Lemma 6.9 A linear operator S on V lies in AI if and only if S{Wj) C Wj 
and the restriction of S to Wj lies in AiWjf for each j, 1 < j < h. 
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If S £ A, then S(Wj) C Wj for each j by Lemma^. To say that S £ A' 
means that Z o S = S o Z for all Z £ A, or 


( 6 . 10 ) 


(ZoS)(v) = (SoZ)(v) 


for all Z £ A and all vectors v in V. Because V is spanned by W \,..., Wh, 
( |6.10| ) holds for all v in V if and only if it holds for all v in Wj for each 
j = 1,. .., h. If v £ Wj , then S(v ) £ Wj and Z(v) £ Wj for all Z £ A. Let 
Sj and Zj denote the restrictions of S and Z to Wj. Then we get that ( |6.10| ) 
holds for all v in V and all Z in A if and only if 


(6.11) (Z j °S j )(v) = (S j °Z j )(v) 

for all v in Wj, all Z in A, and all j = 1 ,,h. This is the same as saying 
that for every j the restriction of S to Wj commutes with the restriction of 
every element of A to Wj, or, equivalently, with every element of A(Wj). 
This completes the proof of the lemma. 


Lemma 6.12 A linear operator T on V lies in A" if and only ifT(Wj) C Wj 
and the restriction ofT to Wj lies in A{Wj)" for each j. 


We know that each Wj is an invariant subspace of T £ A' because each Pj 
lies in A . This is essentially the same as applying Lemma pil in Subsection 
One can finish the argument in essentially the same way as for Lemma 

it 

Theorem |6.6| can be derived from Lemma |6. 1 2| . Alternatively, it is enough 
to show that the operators T on V which satisfy the conditions in Theorem 
|6.6| should lie in A, and for this it is enough to show that they commute with 
every element of A (so that they then lie in A' , which is equal to A by the 
Theorem [2.57| in Subsection |2.7|). This is an easy consequence of Lemma | 


7 Basic facts about decompositions 


Throughout this section G is a finite group, and k is a field which is assumed 
to have characteristic 0 or positive characteristic which does not divide the 
number of elements in G. 

Suppose that p is a representation of G on a vector space V over k. As in 
Lemma |1.43| in Subsection [0|, there is an independent system of subspaces 
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Wi,... ,Wh of V such that V = span(Wi,..., Wh), each Wj is invariant 
under p, and the restriction of p to each Wj is irreducible. We do not ask 
that the restrictions of p to the Wj s be isomorphically distinct. 

Lemma 7.1 In addition to the preceding conditions, assume that U is a 
nonzero vector subspace of V which is invariant under p and for which the 
restriction of p to U is irreducible. Let I denote the set of integers j, 1 < 
j < h, such that the restriction of p to Wj is isomorphic to the restriction of 
p to U. Then I is not empty, and 

(7.2) U C span{Wj : j G /}. 

For each j = 1,..., h, let Pj : V —> V be the projection onto Wj that 
comes naturally from the independent system of subspaces W \,..., Wh, he., 
Pj(u ) = u when u G Wj and Pj(z) = 0 when z G Wi, l ^ j. Thus Yilj=\ Pj — I 
and each Pj commutes with the operators p x , x in G, from the representation, 
because the Wfa are invariant under the representation. 

Define Tj : U —> Wj for j — 1,..., h to be the restriction of Pj to U, now 
viewed as a mapping into Wj. Notice that Tj intertwines the representations 
on U, Wj which are the restrictions of p to U, Wj, since Pj commutes with the 
operators p x , x in G. Because the restrictions of p to U, Wj are irreducible, 
we have that each Tj is either equal to 0 or is a one-to-one linear mapping 
onto Wj, by Proposition 

If J denotes the set of j's such that Tj ^ o, then 

(7.3) U C span{lFj : j G J}, 

simply because Pj is equal to 0 on U for j (f J. In particular, J is not equal 
to the empty set, since U ^ {0}. We also have that J C I, since Tj : U —> Wj 
defines an isomorphism between the restriction of p to U and the restriction 
of p to Wj when j G J. This proves Lemma 0 

Now suppose that Tj,..., Y m are nonzero subspaces of V with properties 
analogous to those of W\,..., Wh, namely, Yi,..., Y m is an independent sys¬ 
tem of subspaces of V such that V = span(Yi,..., Y m ), each Y) is invariant 
under p, and the restriction of p to each Yj is irreducible. In general it is 
not true that the Yj 's have to be the same subspaces of V as the ftp’s, after 
a permutation of the indices. For instance, if p is the trivial representation, 
so that each p x is equal to the identity mapping on V, then any subspace 
of V is invariant under p, and any 1-dimensional subspace has the property 
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that the restriction of p to it is irreducible. If the dimension of V is strictly 
larger than 1, then there are numerous ways in which to decompose V into 
1-dimensional subspaces. 

However, it is true that the two decompositions of V have to be nearly 
the same in some respects. Let J be a nonempty subset of {1 ,,h} such 
that the restriction of p to Wj is isomorphic to the restriction of p to Wj when 
j, l G J, and such that the restriction of p to Wj is not isomorphic to the 
restriction of p to Wj when j G J and l fL J. Let L be the set of / G {1, ..., m} 
such that the restriction of p to Wj is isomorphic to the restriction of p to Y; 
when j G J. Then 


(7.4) span{lLj : j G J} = span{Yj : l G L}. 

This is because Y] C span (My, : j G J} when l G L, by Lemma |7T| , and 
similarly Wj C span{Y) : l G L} when j G J. 

Because of the isomorphisms, the Wfs for j G J and the Yf s for l G L 
all have the same dimension. We also have that 


(7.5) dim span {My : j G J} = dim Wj 

j&J 

and 

(7.6) dimspan{y ; : l G L} = ^ dim 1), 

leL 

since the W/'s and Y {s form independent systems of subspaces. Using (|7.4|) 
we obtain that J and L have the same number of elements 

We can do this for all subsets J of {1,..., h} with the properties described 
above, and these subsets exhaust all of {1,..., h}. The corresponding sets 
L C {1,..., m} exhaust all of {1,..., m} as well, because of Lemma m To 
summarize, the same irreducible representations of G occur as restrictions 
of p to the Wj ’s as occur as restrictions of p to the Yds, up to isomorphic 
equivalence, and they occur the same number of times. In particular, we 
get that h = m. Although the various subspaces do not have to match up 
exactly, we do have ( [7.4|) . 

Now let Z be a vector space over k, and a be an irreducible representation 
of G on Z. Let F(G ) denote the vector space of fc-valued functions on G. 

Lemma 7.7 Suppose that X is a nonzero linear functional on Z, i.e., a 
nonzero linear mapping from Z into k. For each v in Z, consider the function 
f v (y) on G defined by 

(7-8) f v {y) = X((Jy-i(v)). 
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Define U C F(G) by 

(7.9) U = {. f v {y ) : v E Z}. 

Then U is a vector subspace of F(G) which is invariant under the left regular 
representation (Subsection M), and the mapping v t—> f v is a one-to-one 
linear mapping from Z onto U which intertwines the representations a on 
Z and the restriction of the left regular representation to U. In particular, 
these two representations are isomorphic. 

Clearly v i—> /,, is a linear mapping from Z onto U, so that U is a vector 
subspace of F(G). Let us check that the kernel of this mapping is the zero 
subspace of Z. 

Suppose that v is a vector in Z such that f v (y) is the zero function on 
G. This is the same as saying that A(cr y -i(u)) = 0 for all y in G. Let W be 
the subspace of Z which is spanned by cr y (v), y G G. Then W is invariant 
under the representation er, because of the way that it is defined. If v 0, 
then W is not the zero subspace, and indeed v lies in W since a e (v ) = v. 
The irreducibility of a then implies that W — Z. On the other hand, A is 
equal to 0 on W, and we are assuming that A is a nonzero linear functional 
on Z. Thus we conclude that v — 0. This shows that the kernel of v f v is 
the zero subspace of Z, and hence that this mapping is one-to-one. 

Let L x , x G G, denote the left regular representation of G, as in Subsec¬ 
tion E3 For each x in G and v in Z we have that 

(7.10) L x (f v )(y) = f v (x~ l y) = X(a y -i x (v)) 

= X(a y -i(a x (v))) =f a ^ v ){y)- 

This says exactly that the mapping v i—> f v intertwines the representations a 
on Z and the left regular representation on F(G). In particular, U is invariant 
under the left regular representation. Lemma O follows easily from these 
observations. 

Remark 7.11 Lemma |7.7| and its proof work for any field k, without any 
assumption on the characteristic of k. 

Proposition 7.12 Suppose that for each x in G an element a x of the field 
k is chosen, where a x ^ 0 for at least one x. Then there is a vector space Y 
over k and an irreducible representation r on Y such that the linear operator 

(7.13) J2 a * T * 

xeG 
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on Y is not the zero operator. 


Suppose first that we do not worry about having r be an irreducible 
representation. Then we can use the left regular representation on the space 
of /e-valued functions on G. The operator in question is 

(7.14) 5> X L X . 

x£G 


Let (j) e (z) denote the function on G defined by (j) e (z) = 1 when z = e and 
4> e (z) = 0 when z ^ e. Then 

(7.15) ) ( a x L x (0e) (w) a w 

xeG 


for all w in G. The assumption that the a x ’s are not all 0 says exactly that 
this is not the zero function on G, and hence that (|7.14| ) is not the zero 
operator on the space of functions on G. 

From Lemma 1.43 in Subsection L5 we know that the vector space of de¬ 
valued functions on G is spanned by subspaces which are invariant under the 
left regular representation, and to which the restriction of the left regular 
representation is irreducible. It follows easily that the restriction of the 
operator ( |7.14|) to at least one of these subspaces is nonzero, since it is not 
the zero operator on the whole space of functions. To get Proposition |7 .1 2| , 
we take r to be the restriction of the left regular representation to such a 
subspace. 

Now let V be a vector space over k, and let p be a representation of G 
on V of the following special type (which is unique up to isomorphism). We 
assume that V has an independent system of subspaces Hq,..., W rn such 
that V = span (IF ( ,..., W m ), each Wj is invariant under p, the restriction of 
p to each Wj is an irreducible representation of G, the restriction of p to Wj is 
not isomorphic to the restriction of p to IF when j ^ l, and every irreducible 
representation of G on a vector space over k is isomorphic to the restriction of 
p to some Wj, 1 < j < m. In other words, this representation is isomorphic 
to a direct sum of irreducible representations, in which every irreducible 
representation of G on a vector space over k appears exactly once, up to 
isomorphism. One can also think in terms of starting with the left regular 
representation of G (over k) and passing to a suitable invariant subspace, to 
avoid repetitions of irreducible representations (up to isomorphism). 

Let A denote the set of linear operators on V associated to the represen¬ 
tation p, as in Subsection 1 2A. 
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Lemma 7.16 A linear operator T e £(V) lies in A if and only if it can be 
written as 

(7.17) T=J2a*Px, 

xeG 

where each a x lies in k. Each operator T e A can be written as fl 7.l r \ ) in a 
unique way. In particular, the dimension of A as a vector space is equal to 
the number of elements of G. 


The first statement is a rephrasal of the definition of A given at the 
beginning of Subsection [ 


The uniqueness of the expression for T in (|7.17|) 
follows from Proposition 7.12 , since all irreducible representations of G occur 
as restrictions of p to a subspace of V. The last assertion in the lemma is an 
immediate consequence of the first two. 

Because the restrictions of p to the Wf s are isomorphically distinct, The¬ 
orem |6.6| leads to rather precise information about how A looks in this situ¬ 
ation. 

If the field k is algebraically closed, such as the held C of complex num¬ 
bers, then the algebra of operators generated by an irreducible representation 
is the algebra of all linear operators on the corresponding vector space. For 
this we use Proposition |4.5| to say the commutant of the algebra consists of 
only the scalar multiples of the identity, and then Theorem 2.571 hr Subsection 
It follows that the number of elements of G is equal to the sum of the 
squares of the degrees of the isomorphically-distinct irreducible representa¬ 
tions of G in this case. Note that there is always an irreducible representation 
of degree 1, namely the representation of G on a vector space of dimension 
1 in which each x in G is associated to the identity operator. 

Let us look now at the center of A, which is the set of operators in A 
which commute with all other elements of A. 


Lemma 7.18 An operator T = ffx&G a x Px •A lies in the center of A if 
and only if a x = a y whenever x and y are conjugate inside the group G, i.e., 
whenever there is a w in G such that y = wxw -1 . 

Indeed, T commutes with all elements of A if and only if T commutes 
with p z for all z in the group. This is the same as saying that 

(7.19) p z ( ^ ( &xPx^Pz ) ) O'x 

x£G xGG 
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for all z in G. This can be rewritten as 


(7.20) 'y ( d x Pz _1 XZ y ) &X Pxi 

x^G x£G 

or as 

(7.21) y ) dzxz -1 Px y ) Q'xPx- 

xGG xGG 

In other words, the mapping from x to zxz~ l permutes the elements of the 
group, and in the last step we made a change of variables in the sum using 
this permutation. We conclude that T lies in the center of A if and only if 
a zxz -i — a x for all x and 2 in the group. This proves the lemma. 

If we write x ~ y when x and y are conjugate elements of the group, 
then this defines an equivalence relation on the group, as is well-known and 
easy to verify. The equivalence classes for this relation are called conjugacy 
classes. The lemma can be rephrased as saying that T = J2 x eG a x Px lies in 
the center of A if and only if the coefficients a x are constant on the conjugacy 
classes of G. 

Note that the center of A is automatically a subalgebra, and a vector 
subspace in particular. 

Corollary 7.22 The dimension of the center of A is equal to the number of 
conjugacy classes in the group. 

This is an easy consequence of the previous remarks. 

If the field k is algebraically closed, then it follows that the number of 
conjugacy classes in the group is equal to the number of isomorphically- 
distinct irreducible representations of the group. This uses Proposition |4.5| . 

8 p- adic numbers 

Let Q denote the held of rational numbers, and fix a prime number p. 

The p-adic absolute value \ ■ \ p on Q is defined as follows. If x = f p k , 
where a, b , and k are integers, with a, b 7^ 0 and neither a nor b divisible by 
p, then we set 

(8.1) \x\ P = p~ k . 

If x = 0, then we set \x\ p = 0. 


56 


The p-adic absolute value | • \ p enjoys many of the same features as the 
classical absolute value | • |, dehned by \x\ = x when x > 0 and \x\ = —x 
when x < 0. In particular, 

(8.2) \x + y\ p < \x\ p + \y\ p 
and 

(8.3) \xy\ p = \x\p\y\p 

for all x, y G Q. In fact, instead of (|8.2| ) there is a stronger “ultrametric” 
version of the triangle inequality, which states that 


(8.4) 


\x + yip < max(\x\ r , |t/|„), 


whose analogue for the standard absolute value | • | is not true. It is not 

), just from the definitions. 


difficult to verify (|8.3|) and 

Just as for the standard absolute value, we can define a p-adic distance 
function d p (x,y ) on Q using | • | p , by 


(8.5) 


d P (x,y) = \x- y\ } 


This satisfies the usual requirements for a metric (as in [|Rud|| ), which is to 
say that d p (x, y) is a nonnegative real number which is equal to 0 if and only 
if x = y, d p (x,y ) = d p (y,x), and d p (x,y ) satisfies the triangle inequality. In 
this case we have the stronger condition 


(8.6) d p (x, z ) < rna x(d p (x, y), d p (y, z)) 

for all x,y,z G Q, because of (|8.4D . Thus one says that d p (x,y) is an ultra¬ 
metric, referring to this stronger version of the triangle inequality. 

Recall that if (M, D(u, v)) is any metric space, then a sequence {u}^! of 
points in M is said to converge to a point u in M if for every positive real 
number e there is a positive integer L such that 


(8.7) D(uj,u ) < e for all j > L. 

Similarly, a sequence {vj}^ =1 in M is said to be a Cauchy sequence if for 
every e > 0 there is an L > 0 so that 


( 8 . 8 ) 


D{vj, Vk) < e for all j , k > L. 
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It is easy to see that every convergent sequence is also a Cauchy sequence. 
It is not true in general that every Cauchy sequence converges to some point 
in the metric space, but this is true in some situations. 

A metric space (M,D(u,v)) is said to be complete if every Cauchy se¬ 
quence in M also converges in M. A classical example is the real line R, 
equipped with the standard metric \x — y\, as in 
numbers equipped with the standard metric \x — y\ is not complete, because 
a sequence in Q which converges as a sequence in R is always a Cauchy 
sequence, whether or not the limit lies in Q. 

By a completion of a metric space (M, D(u,v)) we mean a metric space 
(Mi, Di(u, v)) together with an embedding 9\ : M —> Mi with the following 
three properties: (a) the embedding is isometric, in the sense that 

(8.9) Di(9i(u),9i(v)) — D(u,v) for all u,v E M] 


iR.udjl . The set Q of rational 


(b) the image of M in Mi under 9\ is a dense subset of Mi, so that 

(8.10) for every w G Mi and e > 0 there is 

a u E M such that Di(w, 9i(u)) < e; 


and (c) the metric space (Mi, Di(u,v)) is itself complete. A completion of a 
metric space always exists, as in Problem 24 on p82 and Problem 24 on pl70 
of ]RTT7T|| . If (Mi, Di(u, v)), 9 X : M -»■ M, and (. M 2 ,D 2 (z,w )), 9 2 : M -> M 2 
are two completions of the same metric space (M, D(x,y)), then there is an 
isometry 0 from Mi onto M 2 such that 9 2 — 0 ° 9i. Indeed, one can define 
0 on the dense subset 9i(M) of Mi through this equation, and then extend 
0 to all of Mi using uniform continuity and completeness (as in Problem 13 
on p99 of [ [Rudf l ). 

The standard embedding of Q in R provides a completion of Q equipped 
with the standard metric \x — y\. What about Q equipped with the p-adic 
metric d p (x,y)l Let us begin with a representation for rational numbers 
connected to the p-adic geometry. 


Lemma 8.11 Let x be a nonzero rational number, and let l be the nonzero 
integer such that \x\ p = p~ l . There is a sequence {ajof nonnegative 
integers such that eg 0, aj < p — 1 for all j, and the series 

OO 

(8.12) EWP’ 

3=1 

converges to x in Q with respect to the p-adic metric. 
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In other words, if 
(8.13) 

for n > l, then d p (s n , x ) 


Sn = E a 3 P* 

3=1 

0 as n —>■ oo. In fact, 


(8.14) 


d r (s n ,x)<p 


—n —1 


for each n> l. 


We can use ( |8.14| ) to choose the a /s one after the other. Let us start with 
cq. Since |x| p = p~ l , we can write x as ( a/b)p l , where a and b are nonzero 
integers which are not divisible by p. We choose cq, 1 < cq < p — 1, so that 
a = aib modulo p. This ensures that d p (si,x ) < p~ l ~ l , where si = aip 1 . 

Now suppose that aq has been chosen for j = /, /+1,..., n for some integer 
n, in such a way that ( |8.14|) holds, and let us choose a n+ i. Since d p (s n ,x ) < 
p~ n_1 , we can write x — s n as (a'/b') p n+1 , where a' and b' are integers, and 
b' is not zero and not divisible by p. We choose a n+ 1 , 0 < a n+ \ < p — 1, 
so that a' = a n +\ b' modulo p. This ensures that d p (s n+ i,x) < p~ n ~ 2 , with 


s n+ i = s n + a n+1 p n+1 , as in 

This gives a sequence { otj as required in Lemma p.ll| . We shall explain 
another way to get such a sequence in a moment, using the next lemma. 


(EH- 


Lemma 8.15 Let l be an integer, and let be a sequence of arbitrary 

nonnegative integers. There is a sequence of nonnegative integers 

such that aij < p — 1 for all j and 

n n 

(8.16) X! ~ X/L 

3=1 j=l 

can be written as c n p n+1 with c n a nonnegative integer for all integers n > l. 


This can be derived in much the same manner as before. We choose cq 
so that it is equal to (3i modulo p. Suppose that aq, l < j < n, have the 
properties described in the lemma, and let us choose a n +\. By assumption, 

n n 

(8.17) c n = (E PiP* ~ E “jP 7 ') 

3=1 3=1 

is a nonnegative integer, and we choose a n+ 1 , 0 < a n+ i < p — 1, so that it is 
equal to c n + /3 n+1 modulo p. This implies that 

n+1 n+1 

(8.18) c n+ 1 = p~ n ~ 2 (E A^ - E a 3 P 1 ) 

j=l j=l 
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is also a nonnegative integer, as desired. 

In the context of Lemma |8.15| , if YlpLi Pj P* converges to a rational number 
in the p-adic metric, then YPpLi otj P J converges to the same rational number 
in the p-adic metric, since the difference between the partial sums tends to 
0 in the p-adic absolute value. 

Let us return to the setting of Lemma |8 .1 1|. Let x be a nonzero rational 


number, and assume first that a; is a positive integer. Then we can write x 


is a nonnegative integer such 


as 

n 

(8.19) X = y Qq p J , 

1=0 

where n is a nonnegative integer and each aq 
that 0 < otj < p — 1. One can begin by choosing n as large as possible so 
that p n < x, and then take a n so that 0 < x — a n p n < p n . Afterwards 
the remaining part x — a n p n can be treated in a similar manner, and so 
on. Alternatively, op is uniquely determined by the requirement that x = op 
modulo p, is determined by x — op = ®iP modulo p 2 , etc. 

More generally, if x is a positive integer times p l for some l E Z, then 
there is a finite expansion for x as in Lemma |8 .1 1| . Of course we can write x 
in this way where the integer factor is not divisible by p, so that the leading 
term in the expansion is nonzero. 

What about negative numbers? Consider x — —1, for instance. We can 
write —1 as (p — 1)/(1 — p), and this leads to the expansion 


(8.20) -1 = £(P-1)PL 

i=o 

This is a kind of geometric series, which would not converge in the classical 
sense, because p > 1. However, this series does converge to —1 with respect 
to the p-adic metric. Indeed, by the usual formula, 

n n | _ „n+1 

(8- 21 ) Y,('P- l )P‘ = - (P“ !)—;-= -1 ~P n+ \ 

1=0 1=0 l ~P 

so that 

n 

(8.22)d r (£(p-l)p*, -l) = (-1 -p” +1 ) - l| p = | -P n+1 \ r = p-"- 1 . 

1=0 


More generally, suppose that x = —1/6, where 6 is a positive integer 
which is not divisible by p. Let c be a positive integer such that c < p — 1 
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and —be = 1 modulo p. Thus x = — 1/5 = c/( 1 — bip), where b\ is a positive 
integer. As in the preceding situation, we get that 

OO 

(8.23) x = ^cb{p j , 

3=0 

with convergence in the p-adic metric, because 

n n i _ (h, r >) n+ 1 

(8.24) 51 cb iP 3 = c J2( b iP) J = c — 7—7 -= x{l - (bip) n+1 ). 

3=0 j= 0 1 P 

This expansion does not quite fit the conditions in Lemma |8.11| , because 
the coefficients cb{, which are positive integers, are not bounded by p — 1 in 
general. However, one can use Lemma |8. 15| to convert this expansion into 
one that does satisfy the additional restriction on the coefficients. 

Before proceeding, let us note the following. 

Lemma 8.25 Let l, l 1 be integers, and let {/d,}^ and be sequences 

of nonnegative integers. Define a sequence { 7 i}iZi + i' by 

(8.26) 7 i = £{&& ■i = j + k,j>l,k>l'} 

(i.e., { 7 i}ih + i' is the “Cauchy product” of {ffi jfLi and Then, for 

each integer n > 0, 

l+n l'-\-n Z+Z'+n 

(8.27) (£/%?*)(£&p‘)- E 

j=l k=l' 

is a nonnegative integer which is divisible by p l+l ' +n+1 . 


To see this, one might prefer to think about the case where l = V = 0, 
to which one can easily reduce. The main point is that every term occurring 
in the sum of the 7 /s in ( |3.27|) also occurs when one expands the product 
of the two sums in ( |8.27| ). There are additional terms in the product of the 
sums, which are all of the form cp m , where c is a nonnegative integer and 
m > l + l' + n + 1. 


Corollary 8.28 In the situation of Lemma \ 8. 23j , suppose that the series 
YffiLi (3j P j andYffkLv PkP k converge to rational numbers u, v!, respectively, in 
the p-adic metric. Then the series YffiLi+v li p l converges to the product uu' 
in the p-adic metric. 
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This is an easy consequence of the lemma. Note that this result does 
not hold for infinite series in the classical sense without some additional 
[llud|| for more information. 

suppose that x is a 


hypotheses. See 


To finish our alternate discussion of Lemma 8.11 


nonzero rational number. For special cases we have given expansions for x 
in terms of sums of products of nonnegative integers with powers of p. One 
can get arbitrary x by taking products, using Corollary fg.28| to ensure that 
this leads to the correct limits. One also employs Lemma |3.15| to obtain an 
expansion in which the coefficients are bounded by p — 1 . 

Let us record some more lemmas. 


Lemma 8.29 Suppose that l is an integer, {Pj}JL l is a sequence of integers 
(of arbitrary sign), and that the sum YffLi Pj P 3 converges in Q in the p-adic 
metric. Then 


(8.30) 


Eft 

j=l 


pj 


< p 


-i 


This is not hard to verify, from the definitions. Of course 


(8.31) 


Eft ft 


< p- m 
P 


for all integers m and n such that l < m < n. 


Lemma 8.32 Suppose that l is an integer, is a sequence of integers 

(of arbitrary sign), and that the sum J2 c f) = iPjP : ’ converges in Q in the p-adic 
metric. If Pi ^ 0 modulo p, then 


(8.33) 


Eft ft 


3=1 



P 


Again, this is not hard to verify. By assumption, the leading term in the 
sum has p-adic absolute value p~ l , while the rest has p-adic absolute value 

< p~ i_1 . 


Lemma 8.34 The expansion described in Lemma 8.11 is unique. In other 
words, suppose that x is a nonzero rational number, l, l 1 are integers, and 
are sequences of nonnegative integers such that ay ^ 0 , 
a[, 7 ^ 0 , 0 < aj < p — 1 for each j > l, 0 < a) < p — 1 for each j > V, and 
the sums YffLijS, YSpLi' <y ' 3 P 3 both converge in Q to x in the p-adic metric. 
Then l = l' and aj = a) for all j > l. 
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Notice first that l = l' under these conditions, since \x\ p = p~ l and \x\ p = 
p~ l , as in Lemma |8.32| . Next, cq = a[, because otherwise the difference 
between the two series has p-adic absolute value p~ l . One can subtract the 
leading terms and repeat the argument to get that ct,- = a' for all j, as 
desired. 


Lemma 8.35 Suppose that l is an integer and that {aq}?L z is a sequence of 
nonnegative integers such that ctj < p — 1 for all j. If YfpLi&jP' converges 
in Q to 0 in the p-adic metric, then ctj = 0 for all j. 


This is easy to check. 

Let us now define Q p , the set of p-adic numbers, to consist of the formal 
sums of the form 

OO 

(8.36) E a iP J ’ 

3=1 


where l is an integer, and each oq is an integer such that 0 < ctj < p — 1 for 
each j > l. If 

OO 


(8.37) 


3=V 


is another such formal sum, with l' < l, say, then the two sums are viewed as 
representing the same element of Q p if a' = 0 when V < j < l and a' = aq 
when j > l. 

If a formal sum Q8.361 ) actually converges to a rational number x with 
respect to the p-adic metric, then let us identify that element of Q„ with 


x 


G Q. The preceding lemmas imply that no more than one element of 


is 


identified in this manned with a single rational number x, and Lemma [S.ll 
says that every rational number is included in Q p through this recipe. Thus 
we can think of Q as a subset of Q p . 

The p-adic absolute value | ■ L can be extended to Q„ by setting 


(8.38) 


E^-p 5 ' =p 

3=1 P 


when cp ^ 0 and 0 < ctj < p — 1, otj G Z for all j (as usual). The zero 
element of Q p , which corresponds to the series with all coefficients equal 
to 0, and to the rational number 0 under the identification discussed in the 
previous paragraph, has p-adic absolute value equal to the number 0. For the 
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elements of Q p which correspond to rational numbers, this definition of the 
p-adic absolute value agrees with the original one, as in the earlier lemmas. 

Next, let us extend the p-adic distance function d p (-, •) to Q p . Suppose 
that 

OO OO 

(8.39) J2 a 3^^ 

3=1 3=1' 

are two elements of Q p , where, as usual, l and V are integers, a 3 is an integer 
such that 0 < atj < p — 1 for all j > l, and a'- is an integer such that 
0 < a'j < p — 1 for all j > V. If either of these two sums is the zero element 
of Q p , so that all of the coefficients are 0, then the p-adic distance between 
the two sums is defined to be the p-adic absolute value of the other sum. If 
they are both the zero element of Q p , then the p-adic distance is equal to 
0. Assume now that both sums correspond to nonzero elements of Q p . In 
this event we ask that cq ^ 0 and a[, ^ 0 , which can always be arranged 
by dropping initial terms with coefficient 0, if necessary. If / ^ l f , then the 
p-adic distance between the two sums is defined to be . Suppose 

instead that l = V. If oq = for all j > l, then the two sums are the same, 
and the p-adic distance between them is defined to be 0. Otherwise, let jo 
be the smallest integer > l such that aq 0 ^ a'- Q . In this case the distance 
between the two sums is defined to be p - - 70 . It is not hard to check that this 
definition of the p-adic distance agrees with the earlier one when both sums 
correspond to rational numbers. 

By definition, the p-adic distance on Q p is nonnegative and equal to 0 
exactly when the two elements of Q p are the same. The distance is also 
symmetric in the two elements of Q p . As before, the p-adic distance satisfies 
the ultrametric condition 


(8.40) 




\ t' \ ' 


for all w±, w 2 , and uq in Q p . This is not difficult to verify. 

In this way Q p becomes a metric space. It is easy to see that Q defines 


a dense subset of Q p , since sums (|8.36|) in which all but finitely many co¬ 


efficients are 0 correspond to rational numbers, and arbitrary sums can be 
approximated by these. One can also show that Q p is complete as a metric 
space, so that it is indeed a completion of Q. 

Like the real line with its standard metric, Q p enjoys the property that 
closed and bounded subsets of it are compact. For this it is enough to know 
that closed balls around the origin are compact. Fix an integer l, and consider 
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the set of w in Q p such that \w\ p < p l . This can be described exactly as 
the set of sums 

OO 

(8.41) 

3=1 

where each oq is an integer such that 0 < oij < p — 1. Now all of the sums 
have the same starting point (at l ), but it is important to allow the initial 
coefficients to be 0, or even all of the coefficients to be 0, to get the right 
subset of Q p . 

Topologically, this set is equivalent to a Cantor set (with p pieces at each 
stage). It is convenient to look at the topology in terms of sequences in the 
set, and convergence of sequences. Namely, a sequence of elements 

of this set converges to another element w of this set exactly if, for each 
j > l, the jth coefficient in the sum associated to the w t ' s is equal to the 
jth coefficient of the sum associated to w for all sufficiently large t. (How 
large t should be is permitted to depend on j.) The statement that the set is 
compact means that for any sequence of elements of the set there is a 

subsequence that converges to an element of the set. This can be established 
through classical arguments. 

Suppose that l is an integer, and that {/?.,■ is a sequence of nonnegative 
integers, without the restriction (3j < p — 1. Consider the formal sum 

OO 

(8.42) 

3=1 

We can view this as still giving rise to an element of Q p , using Lemma |8.15| 
to convert this to a sum a j P 7 where each aj is a nonnegative integer 
such that aj < p — 1. 

This remark permits us to define operations of addition and multiplication 
on Q p . Specifically, one first adds or multiplies two sums in the obvious 
manner, to get a sum in the form ( |8.42|) . One then gets an element of Q p as 
described in the previous paragraph. 

For elements of Q p that correspond to rational numbers, these operations 
of addition and multiplication give the same result as the usual ones. The 
elements of Q p corresponding to the rational numbers 0 and 1 are additive 
and multiplicative identity elements for all of Q p . One also has the usual 
commutative, associative, and distributive laws on Q p . Furthermore, these 
operations define continuous mappings from Q p x Q p into Q p , with respect 
to the p-adic metric. 
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As before, —1 can be written as Y^’jLoip ~ 1 ) p’■ Multiplication by —1 
gives additive inverses in Q p , just as in Q. If re is a nonzero element of Q p , 
then w has a multiplicative inverse in Q p . To see this, it is convenient to find 
a representation for — 1 /w, from which one can get 1/w by multiplication 
by —1. Assume that w is given as J2jLi a jP’i where l is an integer, atj is a 
nonnegative integer such that a.j < p — 1 for all j, and cq ^ 0. Let c be an 
integer such that 1 < c < p — 1 and ccq = — 1 modulo p, and let (3 be the 
nonnegative integer such that c on + 1 = [3 p. Consider the expression 

OO OO 

(8.43) J2 c P~ l (pP + H ca i+iP i y■ 

j =o i =i 


It is easy to expand this out to get a sum of the form 
element of Q p . One can think of ( |S.43|) as corresponding to 


and hence an 


(8.44) 


—1 cp 1 cp 1 

w 1 - (c wp ' - 1) 1- (pp+EZiCa l+i p^)' 


This works, since the product of 


(8.45) 


OO OO 

j2{pp+j2 ca i+tp i y 

3=0 i= 1 


and 

OO OO 

(8.46) (ftp- 1) + J2 ca i+iP l = J2 ca wP % 

i=l i=0 

is equal to 1. 

Thus Q p is a field. With subtraction defined, it makes sense to say that 
d p (w, z) — \w — z\ p on Qp. It can be somewhat more convenient to write this 
as d p (w, w + u) = |it|p for w, u in Q p . Notice too that 


(8.47) 


w + z |p < max(|w|p, \z\ p ) 


and 

(8.48) \wz\p = \w\ p \z\ p 

for w, z in Q p . 
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9 Absolute values on fields 


Let fcbea field. A function | • |* on k is called an absolute value function on 
k, or a choice of absolute values, if |x|* is a nonnegative real number for all 
x in k, |x|* = 0 if and only if x — 0, and 

(9.1) \xy\* = |x|* \y\* 
and 

(9.2) \x + 2 /|* < |x|* + |2/U 

for all x, y in k. 

Notice that 0 y iclds 

(9.3) |1|* = 1, 

where the 1 on the left side is the multiplicative identity element in k, and 
the 1 on the right side is the real number 1. If £ is a nonzero element of k, 
then we obtain that 

(9.4) \x-% = \x\Z\ 
because 1 = |1|* = |xx _1 |* = |x|* lx -1 ]*. Also, 

(9.5) | — 1U = 1, 

since | — l\l = |(—1) 2 |* = |1|* = 1. As a result, | — x|* = |x|* for all x in k. 

For example, the usual absolute values for real numbers defines an abso¬ 
lute value function on R and Q. The usual modulus of a complex number 
defines an absolute value function on C. 

An absolute value function | • |* on k is said to be non-Archimedian , or 
ultrametric, if it satisfies the stronger condition 

(9.6) \x + y\* < max(|x|*, |p|*) 

for all x, y in k, instead of (|9.2| ). The p-adic absolute values on Q or on Q p 
have this property. 

On any field one can define the trivial absolute value function, which is 
equal to 0 at the zero element of the field and to 1 at all nonzero elements 
in the held. Note that this is an ultrametric absolute value function. If k is 
a finite held, then it is not hard to see that the only absolute value function 
on k is the trivial one. 
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If k 0 is a field, let fc 0 (f) denote the field of rational functions in one variable 
t over k 0 . Thus every element of k 0 (t ) can be written as P(t)/Q(t), where 
P{t) and Q{t) are polynomials in t with coefficients in ko, and Q{t) has at 
least one nonzero coefficient. Two such representations 

(9.7) Pi(t)/Qi(t), p 2 (t)/g 2 (t) 

are considered to dehne the same element of ko(t) when 

(9.8) Pi(t)Q 2 (t) = P2(t)Qi(t). 

If P(t) is the zero polynomial, so that all of its coefficients are 0, then 
P(t)/Q(t) is the zero rational function in A; 0 (t) for any Q{t) which is not 
the zero polynomial. 

Fix a real number A with A > 1. We can dehne a nontrivial absolute 
value function on ko(t) by setting it equal to 0 for the zero rational function, 
and to A~ l when the rational function can be expressed as t l Po(t)/Qo(t), 
where Po(t) and Qo(t) are polynomials in t with nonzero constant terms. It 
is easy to see that this satisfies the conditions described before, including 
the ultrametric version of the triangle inequality. If we restrict this absolute 
value function to the subheld of ko(t ) consisting of constant functions, then 
we get the trivial absolute value function on /c 0 - 

Let ko((t)) denote the held of formal Laurent expansions of finite order 
over k 0 in t. In other words, an element of A; 0 ((t)) is given by a formal series 

OO 

(9-9) 

j=n 

where n is an integer and the cq’s are arbitrary elements of k$. If m is an 
integer with m < n and 

OO 

(9.10) 

l=m 

is another such series, then the two series are viewed as defining the same 
element of A; 0 ((f)) if and only if bj = 0 when m < j < n and aj = bj when 
j > n. The series with all coefficients equal to 0 and whatever starting point 
t n correspond to the zero element of /c 0 ((f)). Elements of ko((t)) can be added 
and multiplied in the usual manner, term by term. The series with constant 
term equal to 1 and all others equal to 0 is the multiplicative identity element 



in /c 0 ((£)). If R(t) is a nonzero element of k 0 ((t)), then R(t) can be written 
as 

(9.11) R{t) =a n t n (l-R 0 (t)), 

where n is an integer, a n is a nonzero element of k 0 , and Ro(t) G k 0 ((t )) is 
a given by a sum with only positive powers of t. One can check that the 
standard formula 

OO 

(9.12) (l-i?oW )" 1 = E i? oW i , 

j =o 

with Ro(t)° interpreted as being 1 , makes sense and works in ((£)). Thus 
nonzero elements of ko((t)) have multiplicative inverses, so that k 0 ((t)) is 
a field. There is a natural embedding of k 0 (t) in /c 0 ((£)), by associating 
to a rational function over ko its Laurent expansion around 0. Of course 
polynomials are contained in k 0 ((t)) as finite sums of multiples of nonnegative 
powers of t, and rational functions can be obtained from this using products 
and multiplicative inverses. 

Let A be a real number with A > 1, as before. One can define an absolute 
value function on ko((t)) by taking the absolute value of the zero element of 
k 0 ((t)) to be 0, and the absolute value of ( |9.9| ) to be A~ n when a n 0. 
It is not difficult to check that this defines an absolute value function with 
the ultrametric version of the triangle inequality, and that it agrees with the 
one described earlier for ko(t) when applied to Laurent series coming from 
rational functions. 

Suppose that k is a field and that | • |* is an absolute value function on k 
which satisfies the ultrametric version of the triangle inequality. We shall say 
that | • |* is nice if there is a subset E of the set of nonnegative real numbers 
such that |a; |* G E for all x in k and E has no limit point in the real line 
except possibly at 0. This is equivalent to saying that for any real numbers 
a, b such that 0 < a < b, the set E n [a, b] is finite. 

Lemma 9.13 An ultrametric absolute value function | • |* on afield k is nice 
if and only if there is a real number r such that 0 < r < 1 and |x|* < r for 
all x G k such that |a;|* < 1. 

The “only if” part of this statement is immediate from the definition. 
Conversely, suppose that there is a real number r as in the lemma. If x, y 
are elements of k such that |x|* < |y|*, then | xy^ 1 ]* < 1 , so that \xy ~ l \* < r 
and |a;|* < r |t/|*. One can use this to show that the absolute value function 
takes values in a set E as above. 
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Lemma 9.14 An ultrametric absolute value function \ ■ |* on a field k is 
nice if there is a positive real number s < 1 and a finite collection x±,... ,x m 
of elements of k such that for each y in k with y|* < 1 there is an Xj, 
1 < j < m, such that \y — xfi * < s. 


To prove this, suppose that the hypothesis of Lemma |9.11| holds, and let 
us check that the condition in Lemma |9. 13 is satisfied. If 1 < j < rn and 


y e k satisfy \y\, 
(9.15) 


< 1 and | y — xf. M < s, then 

\xj\* < maxdxj — 2 /|*, |y|*) < 1. 


We may as well assume that {xfi* < 1 for all j , since otherwise we can reduce 
to a smaller collection of xfis with the same property as in the lemma. Take 
r to be the maximum of s and the numbers |xj|*, 1 < j < m, so that r < 1 . 
If y is an element of k and |y|* < 1, then \y — Xj\* < s for some j, and hence 
|y|* < max(|y — Xj\*, |:c,-|*) < r, as desired. 


10 Norms on vector spaces 

Fix a field k and an absolute value function | • |* on k, and let V be a vector 
space over k. A norm on V with respect to this choice of absolute value 
function on k is a real-valued function N(-) on V such that the following 
three properties are satisfied: (a) N(v) > 0 for all v in V, with N(v) = 0 
if and only if v — 0; (b) N(av) = |a|* N(v) for all a in k and v in V\ (c) 
N{y + w) < N{v) + N(w) for all u, w in V. 

In this section we make the standing assumption that | • |* is an ultrametric 
absolute value function on k, and we shall restrict our attention to norms N 
on vector spaces V over k with respect to | • |* that are ultrametric norms, in 
the sense that 

(10.1) N(v + w) < ma x(N(v),N(w)) 

for all v, w in V. Observe that if N(-) is an ultrametric norm on V, then 
d(v, w ) = N(v — w) is an ultrametric on V, so that 

( 10 . 2 ) d(u,w ) < ma x(d(u,v),d(v,w)) 
for all u, v, and w in V. 

One can think of k as a 1 -dimensional vector space over itself, and then 
the absolute value function | ■ |* defines an ultrametric norm on this vector 
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space. If n is a positive integer, then k n , the space of n-tuplcs of elements 
of k, is an n -dimensional vector space over k, with respect to coordinatewise 
addition and scalar multiplication. Consider the expression 

(10.3) max lx,-1* 

1 <j<n J 

for each x = (xi,...,x n ) in k n . It is easy to check that this defines an 
ultrametric norm on k n . 

Lemma 10.4 Let V be a vector space over k, and let N be an ultrametric 
norm on V. Suppose that v, w are elements of V, and that N(v) ^ N(w). 
Then N(v + w) — max(N(v), N(w)). 

We have that N(v + w) < max(iV(u), N(w)) for all v, w in V, and so we 
want to show that the reverse inequality holds when N(v) ^ N(w). Assume 
for the sake of definiteness that N(v) < N(w). Notice that 

(10.5) N(w) — N((v + w) + (— v)) < ma x(N(v + w), N(—v)) 

= ma x(N(v + w), N(v)). 

Because N(v) < N(w), this implies that N(w) < N(v + w). Hence 

(10.6) max(AT(i;), N(w)) < N(v + w), 
as desired. 


Corollary 10.7 Let V be a vector space over k, and let N be a ultrametric 
norm on V. Suppose that m is a positive integer, and that v\,... ,v m are 
elements ofV such that N(vj ) = N(vi) only when either j = l orvj = = 0. 

Then 

m 

(10.8) n(J2 Vj) = N( Vj ). 


This can be derived from Lemma |10.4| using induction. 


Lemma 10.9 Suppose that V is a vector space over k of dimension n, and 
that N is an ultrametric norm on V. Let E be a subset of the set of nonnega¬ 
tive real numbers such that |x|* e E for all x in k. Ifv i,..., v n+ \ are nonzero 
elements ofV, then at least one of the ratios N{vj)/N(v{), 1 < j < l < n + 1, 
lies in E. 
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Let Vi,... ,v n+ i be nonzero vectors in V. If the ratios N(vj)/N(vi ) do 
not lie in E for any j, l with j ^ l, then one can check that V \,..., u n+1 
are linearly independent in V, using Corollary 110.7] . This contradicts the 
assumption that V has dimension n, and the lemma follows. 

In analogy with the definition of a nice ultrametric absolute value function 
in Section let us say that an ultrametric norm N on a vector space V over 
k is nice if there is a subset E\ of the set of nonnegative real numbers such 
that N(v) lies in E\ for every vector v in V and E\ has no limit point in 
the real line except possibly for 0. The latter is equivalent to asking that 
E\ n [a, b\ be finite for every pair a , b of positive real numbers. 

Corollary 10.10 Let V be a vector space over k, and let N be an ultrametric 
norm on V. If the absolute value function \ • |* on k is nice, then N is a nice 
ultrametric norm on V. 


More precisely, if V has dimension n, and if If is a subset of the set of 
nonnegative real numbers such that the absolute value function | ■ |* on k 
takes values in E, then there are positive real numbers a 1; ..., a n such that 
N takes values in the set E\ = (J"=i % E. Here aE = {as : s £ E}. This 
can be derived from Lemma |10.9| . (Note that the af s need not be distinct.) 

Fix a positive integer n, and let us take our vector space to be k n . We 
shall say that a norm N on k n is nondegenerate if there is a positive real 
number c such that 


( 10 . 11 ) 


c max lx,-L < Nix) 

1 <j<n J 


for all x = (xi,...,x n ) in k n . This condition is automatically satisfied if 
{y G k : |y|* < 1} is a compact subset of k, using the metric \u — u|* on k. To 
see this, observe that it is enough to check that N has a positive lower bound 
on the set of x in k n such that maxi<j< n \xf* = 1 , because of homogeneity. 
By assumption, this is a compact subset of k n in the product topology, and 
N is positive at each element of this set since N is a norm. One can verify 
that N is continuous, and in fact N is locally constant away from 0 in k n . 


Remark 10.12 For any norm N on k n there is a positive real number C so 


that 

(10.13) 


Nix) < C max lx,- 

1 <j<n J 


for all x in k n . This is easy to see, by writing x as a linear combination of 
standard basis vectors, and it applies to norms in general, whether or not 
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they are ultrametric norms. For that matter, the notion of nondegeneracy 
makes sense for norms in general. 


Lemma 10.14 Suppose that the absolute value function \ ■ |* on k is nice. 
Let N be a nondegenerate ultrametric norm on k n , let W be a vector sub space 
of k n , and let z be an element of k n which does not lie in W. There exists 
an element xq ofW such that the distance N(z — xq) is as small as possible. 


Notice first that there is a 5 > 0 such that N(z — x) > 5 for all x G 
W. This uses the fact that W is a closed subset of k n with respect to the 
product topology. For instance, one can describe W as the set of vectors 
where finitely many linear functions vanish, and these linear functions are 
continuous. Because | • |* is nice, N is nice, as in Corollary |10.10| , and hence 
the inhmum of N(z — x) over x in W is attained, since it is positive. 


Remark 10.15 If w G W satisfies N(w) < N(z—x 0 ), then N(z—(x 0 +w)) < 
N(z — xq), so that N(z — (xo + w)) = N(z — xq) if N(z — xq) is as small as 
possible. Conversely, if N(z — (rro + tc)) = N(z — x o), then N(w) < N(z — x o). 


Lemma 10.16 Assume that \ ■ |* is a nice absolute value function on k. Let 
N be a nondegenerate ultrametric norm on k n , let V\ be a vector space over 
k, and let N\ be an ultrametric norm on V\. Suppose that W is a vector 
subspace of k n , and that T is a linear mapping from W to V\. Assume also 
that A is a nonnegative real number such that 

(10.17) 7Vi(T(u)) < AN{v) 

for all v in W. Then there is a linear mapping T\ from k n to V± such that 
Ti(v) = T(v) when v lies in W, and 

(10.18) Ni(Ti(v)) < AN(v) 
for all v in k n . 


To prove this, let Vi, W, T, etc., be given as above. We may as well 
assume that W is not all of k n , since otherwise there is nothing to do. Let z 
be an element of k n which does not lie in W, and set W\ = span (IF, z). We 
would like to extend T first to W\. 
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Let x 0 be an element of W such that N(z — x 0 ) is as small as possible, as 
in Lemma |10.14| , and set y 0 = T(x 0 ). Dehne T x on W\ by 

(10.19) Ti{y + a z) = T(v) + ay 0 

for all v in W and a in k. With this definition 7\ is clearly linear on W\ and 
agrees with T on W. We want to check that 


( 10 . 20 ) 


Ni(Ti(v + a z)) < A N(v + az) 


for all v in W and a in k. This inequality holds when a = 0 by assumption, 
and so we may restrict ourselves to a ^ 0. By the homogeneity of the norms, 
we are reduced to showing that 

(10.21) NiiTiiv + z)) < A N(v + z) 
for all v in W, which is the same as 

(10.22) N 1 (T(v) + y 0 )<AN(v + z). 

We can rewrite this further as 

(10.23) Ni(T(v + x 0 )) < AN(v + z), 

by the definition of y 0 . 

Since 

(10.24) Ni(T(v + x 0 )) < A N(v + x 0 ) 

by hypothesis (because x 0 lies in W) , it suffices to show that 

(10.25) N(v + x 0 ) < N(v + z) 


for all v in W. Notice that 


( 10 . 26 ) 


N(v + x 0 ) = 
< 


N((v + z) + (—z + x 0 )) 
max(iV(u + z),N(—z + x 0 )) 
max(A^(r; + z), N(z — xq)). 


By the manner in which xq was chosen, N(v + z) > N(z — xq). This yields 
(fTTL25|) . 


Thus T can be extended to a linear mapping T\ from the larger sub¬ 
space W\ into V\ with the inequality ( |10.18| ) holding there. By repeating the 
process, we can get an extension to all of k n . This proves Lemma 10.161 . 
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Corollary 10.27 Assume that \ ■ |* is a nice absolute value function on k. 
Let N be a nondegenerate ultrametric norm on k n , and let W be a vector 
subspace of k n . There is a linear mapping P : k n —> W which is a projection, 
so that P(w) = w when w E W and P{y ) lies in W for all v in k n , and 
which satisfies 

(10.28) N(P(v)) < N(v) 
for all v in k n . 

This follows by defining P first on W to be the identity mapping, and 
then extending this to a mapping from k n to W using Lemma |10.16 . 

Suppose that W and P are as in the corollary, and let W 0 denote the 
kernel of P. We may as well assume that W is neither the zero subspace 
of k n nor all of k n , so that the same is true of W 0 . As usual, if / denotes 
the identity mapping on k n , then I — P is a projection of k n onto W 0 , be., 
(/ — P)(v) lies in Wq for all v in k n and (/ — P)(v) = v when v lies in Wo. 
Here we also have that 

(10.29) N((I-P)(v)) < N(v) 
for all v in k n , because 

(10.30) N((I-P)(v)) = N(v-P(v)) 

< ma x(N(v), N(P(v))) = N(v). 

We can go further and say that 

(10.31) N(v) = max(N(P(v)), N((I - P)(v))) 
for all v in k n . That is, 

(10.32) N(y) = N(P(v) + (I - P)(v)) 

< max(N(P(v)),N((I-P)(y))), 

while the reverse inequality follows from the ones that have already been 
derived. 

Let us pause a moment for some terminology. Suppose that V is a vector 
space over k of dimension n, and that xi,...,x n is a basis for V. This 
determines a dual family /),..., f n of linear functionals on V. i.e., each fj is 
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a linear mapping from V into k such that fj{xf) = 1 and fj(x{) = 0 when 
3 ~f~ l - 

Now assume that V is equipped with an ultrametric norm N. We say 
that the basis xi,... ,x n is normalized if 

(10.33) N( Xj ) ■ \fj(v)\, < N(v) 

for all vectors v in V. Note that equality occurs when v — x r 

Lemma 10.34 Assume that \ ■ |* is a nice absolute value function on k. If N 
is a nondegenerate ultrametric norm on k n , then there is a normalized basis 
for k n with respect to N. 

When n = 1 there is nothing to do. For n > 1 one can use induction, 
with the projection operators discussed above permitting one to go from n 
to n + 1. 


Lemma 10.35 Let V be a vector space over k with dimension n, equipped 
with an ultrametric norm N, and let x x ,..., x n be a normalized basis for V, 
with dual linear functionals fi ,..., f n . Then 

(10.36) N(v) = ma x{N( X j) ■ |/j(i>)|* : 1 < j < n} 

for all vectors v in V. 


The normalization condition ( |10.33|) gives one inequality in (|10.36|) . For 
the opposite inequality, we write 


(10.37) v = fi(v) Xi H-h f n {v) x n: 


so that 

(10.38) N(y) < ma x{N(fj(v) Xj) : 1 < j < n}. 


Remark 10.39 If Xi,... ,x n is a normalized basis for V, then we can mul¬ 
tiply the x,j ; s by arbitrary nonzero elements of k and get a new normalized 
basis for V. In particular, if N has the feature that it takes values in the set 
of nonnegative real numbers which occur as values of | • |*, then we can mul¬ 
tiply the Xj’s by elements of k to get a basis yi,... ,y n which is normalized 
and satisfies N(yj) = 1 for each j. If hi,..., h n is the corresponding dual 
basis of linear functionals on V, then (|10.36|) becomes 


(10.40) 


N(v) = niax{|/ij(t’)|* : 1 < j < n} 


for all v in V. 
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Let G be a finite group, and p be a representation of G on k n . Let N 0 be 
any fixed ultrametric norm on k n . Define N(v) on V by 

(10.41) N(v) = max{IVo(p 0 (u)) : a G G}. 

It is not difficult to check that this defines an ultrametric norm on k n . In¬ 
deed, each N(p a (v)) defines an ultrametric norm on k n , and passing to the 
maximum yields an ultrametric norm as well. By construction, this norm 
has the property that 

(10.42) ^ N(p b (v)) = N(v) 

for all b in the group G and all vectors v in V. If we choose N 0 so that it is 
nondegenerate, such as N 0 (x ) = maxi <j< n Xj |*, then N is also nondegener¬ 
ate. Similarly, if we choose Nq so that it takes values in the set of nonnegative 
real numbers which occur as values of the absolute value function | • |* on k, 
as with N 0 (x) = maxi<j< n then N has the same feature. 

11 Operator norms 

Let k be a field with an absolute value function | • |*. Fix a positive integer 
n, and consider the vector space k n of n-tuples of elements of k. Let N(-) be 
a norm on k n with respect to | • |*, which we assume to be nondegenerate, in 
the sense of (|10.11|) . 

If T is a linear operator from k n to itself, then the operator norm ||T|| op 
of T with respect to N can be defined by 

(11.1) ||T|| op = sup {N(T(x)) : x G k n , N(x) = 1}. 

This supremum is finite because of the nondegeneracy condition for N. One 
can reformulate this definition as saying that 

(11.2) JV(T(x))<||T|U,lV(x) 

for all x in k n , and that ||T|| op is the smallest nonnegative real number with 
this property. 

The collection of linear operators from k n to itself forms a vector space 
over k in the usual manner, and ||T|| op defines a norm on this vector space, 
i.e., ||T||op is a nonnegative real number which is equal to 0 exactly when T 
is the zero linear operator on k n : 

(11-3) \\aT\\ op = \a\*\\T\\ op 


77 





for all a in k and linear operators T on k n , and 


(11-4) ||Xi + T 2 || op < ||Ti|| op +||T 2 |U 

for all linear operators T\ , T 2 on k n . These properties follow from the corre¬ 
sponding features of the norm N. Notice that 

in-5) imu = i, 

where I denotes the identity operator on k n , and that 

(11-6) ||^i ° T 2 1| op < ||Ti || T 2 \\op 

for any two linear operators T), T 2 . If | • |* is an ultrametric absolute value 
function and N(-) is an ultrametric norm, then the operator norm || • \\ op is 
also an ultrametric norm, so that 

(11-7) ||Ti + T 2 || op < max(||Ti|| op , ||T 2 || op ) 

for all linear operators T\ , T 2 on k n . 

Suppose that T, A are linear operators on k n , T is invertible, and 

(11-8) \\A\\o P <\\T- 1 \\^. 

In this case T + A is also an invertible linear operator on k n . To see this, it 
suffices to show that the kernel of T + A is trivial. Let x be any element of 
k n . If y = T(x), then 

(H-9) N(T-\y)) < WT^WopNiy), 

which is the same as saying that 

(11.10) l|r- , ||- 1 JV(n:) < JV(r(a:)). 

We also have that 

(11.11) N(T(x)) = N((T + A)(x) - A{x)) 

< N((T + A)(x)) + N(A(x)) 

< N((T + A)(x)) + \\A\\ op N(x), 

QiXid hence 

(11.12) (HT-'lli 1 - H^IU) JV(rc) < JV((T+/l)(i)). 
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Thus (T + /L)(x) 7 ^ 0 when x ^ 0, so that T + A is invertible, and we get the 
norm estimate 


(11.13) \\(T + A)-% p < (\\T~^ - \\A\\ op )-\ 


If 


is an ultrametric absolute value function and N is an ultrametric 


norm, then (| 11 . 11 |) can be replaced with 


(11.14) N(T(x)) < max(N((T + A)(x)),N(A(x))) 

< max(7V((T + A)(x)), || A\\ op N(x)), 


so that 

(11.15) IIT-'H - 1 N(x) < max(JV((T + A){x)), P|UM(x)) 
for all x in k n . Because ||A|| op < ||T~ 1 ||^ p 1 , we obtain that 

(11.16) \\T- 1 \\^N(x)<N((T + A)(x)). 


More precisely, one derives this initially for i ^ 0, and then notes that it 
holds automatically for x = 0 , so that the inequality applies to all x in k n . 
This leads to 


(11.17) ||CT + .1)- 1 |U< 


m 


the ultrametric case (under the condition (| 11 . 8 | )). 


Definition 11.18 Let B be an algebra of operators on k n which is a division 
algebra (Definition \5.1\ ). We say that B is a uniform division algebra if there 
is a positive real number C so that 

(11.19) IIT-'ll^scurii- 1 

for all nonzero operators T in B. 


Remark 11.20 The opposite inequality is automatic, i.e., 

(ii.2i) i < lir-'iumu,, 

since T ” 1 o T — I has norm 1. 
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This property does not depend on the choice of a nondegenerate norm N 
on k n , although the choice of N can affect the constant in the definition. If 
k is locally compact, so that {a E k : |a|* < 1 } is a compact set with respect 
to the metric |a — j3\* on k, then any algebra of operators on k n which is 
a division algebra is in fact a uniform division algebra. Indeed, it suffices 
to establish (|11.19|) for the T’s such that ||T|| op = 1, and this is a compact 
subset of the vector space of linear operators on k n under our assumption. 
It is not hard to use compactness to get a uniform bound for ||T _1 | 
desired. 


op ? 


as 


For the rest of this section, let us assume that 


is an ultrametric 


absolute value function on k, and consider the ultrametric norm 


(11.22) Ni(x) = max |x,j* 

l<j'<n 

on k n . 

Let T be a linear operator on k n . Thus T can be described by an n x n 
matrix (ajy) with entries in k, so that 

n 

(11.23) (T(x))j = J2 a j,iXi 

i=i 

for all x in k n , where (T(x))j denotes the jth component of T(x), just as Xi 
denotes the Zth component of x. The operator norm ||Tj| op of T with respect 
to N\ can be given by 

(11.24) \\T\lop = max \a j:l |*. 

1<J,l<n 

To check that ||T|| op is less than or equal to the right side of the equation, 
one can use the ultrametric property of | ■ |*. For the opposite inequality, one 
can look at T(x) in the special case where x has one component equal to 1 
and the rest equal to 0 . 

When is T an isometry, i.e., when does T satisfy 

(11.25) N^Tix)) = N^x) 

for all x in k n 7 This is clearly equivalent to asking that ||Xj| op < 1 and 
HT” 1 ^ < 1. Let us check that this happens if and only if ||Xj| op < 1 and 
| detT|* = 1, where detT denotes the determinant of T (or, equivalently, of 
the matrix (a^y)). Notice first that if ||Tj| op < 1, then |detXj* < 1. This 
uses merely the definition of the determinant, together with the fact that 
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the entries of the matrix associated to T have absolute value less than or 
equal to 1, and the properties of the absolute value function. Similarly, the 
determinant of any square submatrix of T has absolute value in k less than 
or equal to 1. By Cramer’s rule, T ” 1 is given by (detT )” 1 times the cofactor 
transpose of T. The matrix entries for the cofactor transpose have absolute 
value less than or equal to 1 when ||T|| op < 1, since they are given in terms 
of determinants of submatrices of the matrix of T. Thus the operator norm 
of the cofactor transpose is less than or equal to 1 when ||T|| op < 1, because 
of the formula ( |11.24|) applied to the cofactor transpose (instead of T). If 
| detT|* = 1, then it follows that ||T _1 || op < 1 . Conversely, if ||r - 1 1| 0:p < 1, 
then | detT” 1 1* < 1, and hence | detT|* > 1, since detT ” 1 = (detT)” 1 . This 
implies that | det T|* = 1 if ||T|| op < 1 too. 

As in Section 0 - one way that isometries on k n come up is through 
representations of finite groups on k n . That is, we might start with the norm 
Ni, not necessarily invariant under the group representation, and then obtain 
an invariant norm using ( 10.41|) . The new norm can be converted back into 
Ni after a linear change of variables on k n , if the absolute value function 
| • |* is nice, as in Section |10|. In other words, the representation of the finite 
group can be conjugated by an invertible linear transformation on k n to get 
a representation that acts by isometries with respect to N x . Compare with 
Appendix 1 of Chapter IV in Part II of ||Serl . 

In addition to norms for operators on a single vector space, one can 
consider norms for operators between vector spaces. Suppose that Vj, V 2 
are vector spaces over the held k, and that Ni (•), N 2 (-) are norms on Vi, V 2 
with respect to the absolute value function | - |* on k. We assume that N x is 
nondegenerate, as before, with respect to a linear equivalence between Vi and 
k n , where n is the dimension of V x . It is easy to check that the condition of 
nondegeneracy does not depend on the choice of linear isomorphism between 
Vi and k n . 

If T is a linear mapping from V\ to V 2 , then the operator norm ||T|| OPi i 2 
of T with respect to N x , N 2 on Vi, V 2 is defined by 


(11.26) 


||T || op , 12 = sup{lV 2 (T(a;)) : x G Vi, N x {x) = 1}. 


The supremum makes sense because of the assumption of nondegeneracy for 
N x . As before, the operator norm is characterized by the conditions that 


(11.27) 


N 2 (T(x)) < ||T || oP)12 N x (x) 
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for all x in V\ and that ||T || 03 , )12 be the smallest nonnegative real number for 
which this property holds. 

The operator norm defines a norm on the vector space of linear mappings 
from V\ to V 2 , as one can easily verify. If j • |* is an ultrametric absolute value 
function on k, and N\, N 2 are ultrametric norms on V\, V 2 , then || • || 0 p,i 2 is 
an ultrametric norm on the vector space of linear mappings from V\ to V 2 . 

Suppose that V 3 is another vector space over k, and that N 3 is a norm on 
V 3 with respect to the absolute value function | • J* on k. Assume that the 
norm N 2 on V 2 is also nondegenerate in the sense described before. We can 
define operator norms || • || op ,i 3 and || • || op ,23 for linear mappings from V\ to 
V 3 and from V 2 to V 3 , respectively, using the norms , N 2 , N 3 on V \, V 2 , 
V 3 . If T is a linear mapping from V\ to V 2 and S' is a linear mapping from 
V 2 to V 3 , so that the composition S o T defines a linear mapping from V\ to 
V 3 , then 

(H-28) \\SoT\\ oPtl3 <\\S\\ op ,2 3 \\T\\ oPtl 2. 

This is easy to check. 

12 Vector spaces of linear mappings 

Let k be a field, and let V, W be vector spaces over k. Recall that C(V, W ) 
denotes the vector space of linear mappings from V to W. For notational 
simplicity, we shall write H for £(V, W) in this section. 

Suppose that A is an algebra of operators on V. We can associate to A 
an algebra of operators Ah,i o n H, where a linear operator on H lies in Ah. 1 
if it is of the form R^RoT,R^H, where T lies in A. Similarly, if B is 
an algebra of linear operators on W, then we can associate to it an algebra 
Bh ,2 consisting of operators on H of the form R 1 —> S o R, R 6 H, where S 
lies in B. In other words, we are using composition of operators in H with 
operators on V or on W to define linear operators on H. 

Remark 12.1 As a basic case, let A be C{V) and let B be all of C(W). It 
is easy to check that every element of Aha commutes with every element of 
Bh, 2 - br fact, 

( 12 . 2 ) (A/yi ) 7 = Bh, 2 ? {Bh, 2 )' = Ah, 1 , 

where the prime refers to the commutant of the algebra as a subalgebra of 
£{H). This is not difficult to show. 
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Remark 12.3 Let B be any algebra of operators on W. The algebra of 
operators Bh ,2 on H is essentially the same as “expanding” B as in Subsection 
b . 6| . with n equal to the dimension of V. In this way, Proposition |2.6| can be 
reformulated as saying that (B")h, 2 = (Bh, 2 )''■ The description of (Bh^Y in 
the proof of the proposition can be rephrased as saying that (Bh 2 )' contains 
(B'W and (£(V))h,i, and is generated by them. 

Remark 12.4 Let G 1 , G 2 be finite groups, and let cr, r be representations 
of G 1 , G 2 on V, W, respectively. We can define representations a, f of Gi, 
G 2 on H by 

(12.5) &x(R) = R o (PxY 1 , fy(R) = r y oR 

for a: G G 1 , y G G 2 , and R G H. If A is the algebra of operators on V 
generated by cr, then Ah,i is the same as the algebra of operators on H 
generated by a. Similarly, if B is the algebra of operators on W generated 
by r, then Bh,2 is the same as the algebra of operators on H generated by f. 

Remark 12.6 Suppose that k is a symmetric field, and that V, W are 
equipped with inner products (-, -)y, (■, -)wi as i 11 Subsection For every 
linear mapping R : V —r W there is an “adjoint” R*, a linear mapping from 
W to V, characterized by the condition 

(12.7) (R(v),w) w = ( v , R*(w)) v 

for all v G V, w G W. Note that if T is a linear operator on V, and if 5 is a 
linear operator on W, then R o T, S o R are linear mappings from V to W, 
and 

(12.8) (R o T)* = T* o R*, (S o R)* = R* o S*. 

Here T* is the adjoint of T as an operator on V, S* is the adjoint of S as an 
operator on W, and R*, (R o T)*, and (S o R)* are the adjoints of R, R o T, 
and S o R as operators from V to W. (In general, if one has three inner 
product spaces, a linear mapping from the first to the second, and a linear 
mapping from the second to the third, then the adjoint of the composition 
is equal to the composition of the adjoints, in the opposite order.) 

We can define an inner product (■, -)h on H by 

(12.9) (Rh R 2 ) H = tlyRy o R\ = tlwRl 0 R 21 

where try T denotes the trace of a linear operator T on V and tr W / S denotes 
the trace of a linear operator S on W. Recall that if U\ is any linear mapping 
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from V to W, and U 2 is any linear mapping from W to V, so that U 2 is a 
linear operator on V and U\ o U 2 is a linear operator on W, then try U 2 olJ\ = 
tr w UxoU 2 . This is a standard property of the trace, and it implies the second 
equality in (|12.9|) . To see that {Ri,R 2 )h satisfies the positivity condition 
required of an inner product, one can compute tiyR* °R using an orthogonal 
basis of V, and reduce to the positivity condition for the inner product on 
W, since 

(12.10) ((R* o R)(v),v) v = (R(v), R(v))w 


for all v in V. Of course one could instead use an orthogonal basis for W 
and reduce to the positivity condition for the inner product on W. 

Another way to describe the inner product on H is to observe that if R{, 
R 2 are rank-1 operators given by 


(12.11) R^v) = {v,vi) v w u R 2 (v) = (v,v 2 ) v w 2 , 

where iq, v 2 are elements of V and wq, w 2 are elements of W, then R 2 {w) = 
(w,w 2 ) w v 2 and 

(12.12) (Ri, R 2 )h = {v 2 , v\)v {w 1 , w 2 )w- 

If Vi, , v n is an orthogonal basis for V, and Wi, ..., w m is an orthogonal 
basis for W, then 


(12.13) (• ,Vj)yWi , 1 < j < n, 1 < l < m 

is an orthogonal basis for H. 

If T is a linear operator on V, then R t—> R o T dehnes a linear operator 
on H. One can verify that the adjoint of this linear operator, with respect 
to the inner product just defined on H , is given by R ^ R o T*. Similarly, 
if S is a linear operator on W, then R t—> S o R dehnes a linear operator on 
H , and the adjoint of this operator is given by R 1 —> S* o R. 

Thus, if A is a *-algebra of operators on V, then Ah, 1 is a ^-algebra 
of operators on H , and if B is a *-algebra of operators on W, then Bh ,2 is 
a ^-algebra of operators on H. As in Remark |12.3| , this also came up in 
Subsection [2~fj . 


Let A, B be algebras of operators on V, W, respectively. The combined 
algebra of operators C on H is defined to be the algebra generated by Ah,i 
and Bh, 2 - Thus the elements of C are the operators on H which can be 
written as 

(12.14) A\ B\ + A 2 B 2 + • • • + A r B r , 
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where each Aj lies in Aha and each Bj lies in Bh, 2 - Because the elements of 
A H ,i and Bh .2 commute with each other, one does not need more complicated 
products in (|12.14|). 


Remark 12.15 If A = C(V ) and B = C(W ), then the combined algebra C 
is all of C(H). This is not hard to check. 


Remark 12.16 Suppose that A: is a symmetric held and that V and W are 
equipped with inner products, as in Remark |12.6| . If A and B are ^-algebras 
of operators on V and W, then the combined algebra C is a ^-algebra on H 
(using the inner product on H defined in Remark |12.6|) . 


Remark 12.17 Suppose that G i, G 2 are finite groups, and that cr, r are 
representations of G 1 , G 2 on V, W, respectively. Consider the product group 
G\ x G 2l in which the group operation is defined componentwise, using the 
group operations on G 1 , G 2 . Let p be the representation on H obtained from 
a, r by setting 

(12.18) P(x,y){R) =r y oRo (cq,) —1 

for all ( x , y ) in G\ x G 2 and R in H. If A is the algebra of operators on 
V generated by cr, and B is the algebra of operators on W generated by r, 
then the combined algebra C is the same as the algebra of operators on H 
generated by the representation p of Gi x G 2 . 


Lemma 12.19 If A, B are algebras of operators on V, W with dimensions 
r, s, respectively, as vector spaces over k, then the combined algebra C has 
dimension rs. 


To see this, suppose that Ai,... ,A r is a basis for A, and that Bi,... ,B S 
is a basis for B. Consider the operators 

(12.20) R 1 —■> Bj o R o Ai, 1 < j < s, 1 < l < r, 

on H. It is easy to check that the combined algebra C is spanned by these 
rs operators, and we would like to show that they are linearly independent. 

Let Cj,i, 1 < j < s, 1 < l < r, be an family of elements of k such that the 
operator 

s r 

( 12 . 21 ) R^Y,J2 c B B j° RoA i 

j= 1 1=1 
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on H is equal to 0, i.e., 


( 12 . 22 ) 


55 C U BjoRo A , = 
j=i i=i 


0 


for all R in H. If w is any element of W and / is any linear mapping from 
V to k, then R(v) = f(y)w is a linear mapping from V to W, and ( 12.22|) 
can be rewritten for this choice of R as 


(12.23) 


s r 


555 2 c j,i b j{ w ) f(M v )) 

3=1 1=1 


0 


for all v in V. Let us rewrite this again as 

(12-24) ±(±c j ,f(Mv)))B j (w) = 0. 

3=1 1=1 

This holds for all w in W, v in V, and linear mappings / from V to k. For 
any fixed v and /, the linear independence of the B/'s as operators on W 
leads to 

(12-25) 't l Ci, l f(Mv)) = 0 

i=i 

for each j. The linear independence of the Ai s now implies that c^j = 0 for 
all j and l, which is what we wanted. 


13 Division algebras of operators, 2 

Let k be a field, let V be a vector space, and let A be an algebra of operators 
on V which is a division algebra. Put 

(13.1) T = AnA', 

so that T is the center of A, i.e., the collection of elements of A which 
commute with all other elements of A. Thus T is an algebra of operators on 
V which is commutative and a division algebra of operators. 

It will be convenient to write Y for A viewed as a vector space over k, 
for the purpose of considering linear operators on Y. Define A\ to be the 
algebra of linear operators on Y of the form 

(13.2) R^RoT, T e A, 
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and define A 2 to be the algebra of linear operators on Y of the form 

(13.3) R^ToR, T <E A. 

When T lies in T, R o T = T o R for all R in Y, and we write To for the 
algebra of linear operators on Y of the form 

(13.4) R^RoT = ToR, TeV. 

Lemma 13.5 (a) A\, A 2 are division algebras of operators. 

(b) Every element of A\ commutes with every element of A 2 - 

(c) Ai, A 2 are irreducible algebras of operators on Y. 

These properties are easy to verify, just from the definitions and the 
assumption that A is a division algebra of operators. Concerning (c), it can 
be helpful to rephrase the question as follows: if R , R are elements of Y and 
R 7 ^ 0, then there are elements of A \, A 2 which take R to R. 

Lemma 13.6 T 0 = A\ fl .A 2 . 


Indeed, suppose that S , T are elements of A such that the operators 
(13.7) R^RoS, R^ToR 

on Y are the same. We can take R to be the identity operator on V to obtain 
that S = T, and then our hypothesis becomes RoT -To R for all R in A. 
This says exactly that T lies in T, as desired. 

Lemma 13.8 ( A\)' = A 2 and (A 2 )' = A\, where the primes indicate that 
we take the commutant of the given algebra as an algebra of operators on Y. 


The fact that A 2 Q (Al)' and Ai C (A 2 )' is the same as part (b) of 
Lemma |13.5| . Conversely, suppose that <h is a linear transformation on Y 
which commutes with every element of A\. This is the same as saying that 


(13.9) 


<5>(R°S) = <5>(R)oS 


for all R , S in Y, i.e., in A. Applying this to R equal to the identity operator 
/ on V we get that 

(13.10) $(5) = $(/) o S 

for all S in Y. This says exactly that $ lies in *4. 2 , so that (^4i)' C .4. 2 . 
Similarly, (^4 2 )' C Ai, and the lemma follows. 

Let „4 .i 2 be the algebra of operators on Y which is generated by Ai, A 2 . 
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Lemma 13.11 r 0 is the center of An- 


This is easy to check. 


Proposition 13.12 Au = Tq. 

To prove this we use Lemma E3 where now Y has the role that V had 
before, An l ias the role that *4 had before, and T 0 has the role that B had 
before. Of course An Q Tq, which was a standing assumption for Lemma 
|5.18| , mentioned just before the statement of Lemma |5.16| . According to |5.18| , 
it suffices to show that An is irreducible, and that for any elements R\, R 2 , 
Ui, U 2 of Y such that R,\ , R 2 are T 0 -independent there is a $ in An such 
that = U\ and &(R 2 ) = U 2 . (The notion of independence employed 

here was defined near the beginning of Section |5|.) The irreducibility of An 
is a consequence of the irreducibility of either A\ or A 2 , and so it remains to 
verify the second condition. 

For the second condition, it is enough to show that if R, U \, U 2 are 
elements of Y such that R is not an element of T C A, then there is a T in 
An such that T(/) = U±, 'h(A) = U 2 . Here / denotes the identity operator 
on V, as an element of Y = A, just as T can be viewed as a subspace of 
Y. In other words, if Ri, R 2 are elements of Y which are To-independent, 
then they can be mapped to I, Rf 1 R 2 by an element of .A 2 C An-, and the 
T 0 -independence of Ri, R 2 says exactly that Rf 1 R 2 does not lie in T. 

In fact it is enough to show that for every R, U in Y such that R is not 
in T there is a T in ^4 12 which satisfies T(J) = 0 and 'I'(-R) = U. The reason 
for this is that for any U\ in Y there is an element of A\ (or A 2 ) which sends 
/ to U\ (corresponding to composition with U\). To prescribe a value also at 
R, one can use a T in .4 . 12 as in the new version of the condition. 

Thus we let R , U in Y be given, with R not in T. Because R is not in 
T, there is an 5 in Y = A such that RoS — SoR^O. Thus the inverse of 
R o S — S o R exists and lies in A. Let T be the linear operator on Y defined 
by 

(13.13) T(A) = Uo(RoS-So R)- 1 o(AoS-SoA), 

A e Y — A. It is easy to see that T lies in An, be., it can be written as 
a sum of compositions of operators on Y in Ai, A- 2 . Clearly v h(J) = 0 and 
T( R ) = U, as desired. This proves Proposition |13.12| . 


Corollary 13.14 The dimension of An is equal to the square of the dimen¬ 
sion of A = Y divided by the dimension ofT, as vector spaces over k. 












Compare with Lemma and note that the dimension of F is equal to 
the dimension of r 0 . 

Let us now relate this to the set-up in Section [L2]. What were the vector 
spaces V, W before are now both taken to be V. Thus we write H for the 
vector space of linear mappings from V to itself. The algebra of operators A 
on V is now used for both A and B in Section [12|. Let Ah, i, Ah, 2 , and the 
combined algebra C be the algebras of operators defined on H in Section [12 
(with B = A). 

We can view Y = A as a vector subspace of H. As such, it is an invariant 
subspace for Ah,i, Ah, 2 , and the combined algebra C. As in Notation |6A| , 
Ah,i(Y), Ah, 2 00, and C(Y) denote the algebras of operators on Y which oc¬ 
cur as restrictions of operators in Ah, 1 , Ah, 2 , and C to the invariant subspace 
Y. It is easy to check that 


(13.15) A\ = A h ,i{Y), A 2 = A h ,2(Y), Au = C(Y). 


Just as for A , we can associate to T the algebras of linear operators T#i, 
T h ,2 on H of the form 

(13.16) R > RoT, R^ToR, 

respectively, where T lies in T. Thus 


(13.17) 


H, 1 


c A 


H, 1, 


F h;i C Ah, 2 , 


and Y is an invariant subspace for Fh. 1 , Fh, 2 - For T in T, the linear operators 
in ( |13. 16 ) are different as operators on H unless T is a scalar multiple of the 
identity, but the restrictions of these operators to Y are the same, since T is 
the center of A. In particular, 


(13.18) r Hll (y) = r„, 2 (Y) = r 0 . 

Let Ti,..., be a collection of operators in A which are T 0 -independent 
and whose T 0 -span is all of A, as discussed in Section [^, where we think of 
T 0 ~ T as a division algebra of operators on A. In fact, we can view A as 
a vector space over the held T, since T is commutative and commutes with 
all elements of A. In these terms T\,, X) is a basis for A as a the vector 
space over T. If 71 ,..., is a basis for T as a vector space over k, then the 
family of products 7 * o Tj, 1 < i < m, 1 < j < £, is a basis for A as a vector 
space over k. 









As in the proof of Lemma 12.19 , the operators on H given by 

(13.19) R !-»■ 7i 2 o T h oRo^o T ni 1 < i u i 2 <m, 1 < j u j 2 < A 

form a basis for the combined algebra C. When one restricts to Y, as with 
C(Y), then it is enough to consider the operators of the form 


(13.20) 


R ^ 7* ° T j2 o R o T n , 1 < i < m, 1 < ji, j 2 < l, 


since the yds commute with R EY and the T)’s. In other words, on Y, the 
span of these operators is the same as the span of the previous ones. This 
fits with Corollary 113. 14L which implies that m t 2 is equal to the dimension 
of C(Y) = A 12 . 


14 Some basic situations 

14.1 Some algebras of operators 

Let A; be a field, and let V be a vector space over k. Suppose that Vi,... ,Ve 
are vector subspaces of V such that 

(14.1) { 0 } C W C • • • C V t C V, 

where each inclusion is strict. Consider the algebra of operators A on V 
consisting of the linear mappings T : V —> V such that 

(14.2) T(Vj)CVj, 1 <j<£. 

Lemma 14.3 Under the conditions above, if Z and W are arbitrary vector 
subspaces ofV, then there are linear operators T\ , T- 2 in A such that Ti(V ) = 
W and the kernel ofT 2 is Z. 

To be more precise, one can choose T\ so that it satisfies 

(14.4) T 1 {vf)=w HVj, 1 <j<e, 

in addition to Ti(V ) = W. This is not hard to arrange, using a suitable basis 
for V. Notice that this is not the only possibility, however; for instance, if 
the dimension of W is less than or equal to the codimension of Vi in V, then 
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one can choose T\ so that 7\ is equal to 0 on Vg, but maps a subspace of V 
whose intersection with Vg is trivial onto W . As for T 2 , we do need to have 


(14.5) 


{v G Vj : T 2 (v) = 0} = Z fl Vj, 1 < j < l. 


in order for the kernel of T 2 to be equal to Z, and again this is not hard to 
arrange, through suitable bases. 

Proposition 14.6 Under the conditions above, the commutant A 1 of A con¬ 
sists of scalar multiples of the identity operator on V. 

Of course A! always contains the scalar multiples of the identity. Con¬ 
versely, suppose that S lies in A'. Because of Lemmas |14.3| and | 2 .H| . if U is 
any vector subspace of V, then S(U) C U. (For this one might as well even 
consider only 1-dimensional subspaces U of V.) From this it is not hard to 
see that S must be a scalar multiple of the identity operator. 

14.2 A construction 

Let k be a held, let V be a vector space, and let V* denote the dual space 
of V, i.e., the vector space of linear mappings from V into k. If T is a linear 
mapping from V to itself, then there is an associated dual linear operator T 
on V*, which is defined by saying that if f) is a linear functional on V, then 
T(<f>) is the linear functional given by T((f)(v) = <f>{T{y)). The transformation 
from a linear operator T on V to the dual linear operator T is linear in T, 
and satisfies (Tf o T 2 ) = T 2 o T\ for all linear operators T\ , T 2 on V. The 
dual of the identity operator on V is equal to the identity operator on V*, 
and the dual of an invertible operator on V is an invertible operator on V*, 
with the inverse of the dual being the dual of the inverse. There is a natural 
identification of V with the second dual space V**, and the dual of the dual 
of an operator T on V coincides with T under this identification. 

Assume now that A is an algebra of linear operators on V. We can define 
A to be the algebra of linear operators on the dual space V* consisting of 
the duals of the linear operators in A. 

Let W be the vector space which is the direct sum of V and V*. Thus W 
consists of ordered pairs with u in V and (j) in V *, with addition and 

scalar multiplication of vectors defined coordinatewise. Define A to be the 
collection of operators on W of the form 


(14.7) 


(u» ^ (S(u),T(4>)), 
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where S, T lie in A. It is not hard to see that A is an algebra of operators 
on W. The subspaces V x {0} and {0} x V* of W are complementary to 
each other and invariant under A, and, in effect, the restriction of A to these 
subspaces reduces to A and A, respectively. 

Define bilinear forms B \, B 2 on W by 

(14.8) B 1 ((u, 0), (v, 0)) = 0(u) + 4>{v) 
and 

(14.9) B 2 {iu,(f>), (u,0)) =0(u)-0(u). 

Notice that B\ is a symmetric bilinear form while Bo is antisymmetric, i.e., 

(14.10) #i((u,0),(v,0)) = #i((u,0), (u,0)), 

5 2 ((m, 0), (u,0)) = -# 2 ((o0), (u,0)). 

Both are nondegenerate, which is to say that for i = 1 or 2 and for any 
nonzero element (u, 0) of W there is a (v, 0) in W such that 

(14.11) Bi({u,(j)), (v,0)) 0 0 . 

On the other hand, it is not true that B\ is definite, because there are plenty 
of nonzero elements (u, 0) of W which satisfy 


5i((u,0), (u,0)) = 0. 
5 2 ((m,0), (u,0)) = 0 


(14.12) 

For I ?2 we have that 

(14.13) 

for all (u, 0) in ID, which is in fact equivalent to the antisymmetry of B 2 . 

Suppose that S, T are elements of A, and that R is the element of A 
given by R(u, 0) = (S(u), T(0)), as in ( |14.7| ). Observe that 

(14.14) #! (r(u, 0), {v, 0 )) = 0(S'(u)) + 0 (T(u)) = #i((u, 0 ), #*(u, 
where i ? 4 is the element of A defined by 


(14.15) i0(u,0) = (2»,S(0))- 


Similarly, 

(14.16) B 2 (R(u, 0), (u,0)) = 0(S(w)) - 0(7») = # 2 ((n,0),#0u,0)). 


92 





To put it another way, R* is the “transpose” of R as a linear mapping 
from W to itself with respect to each of the bilinear forms Bi and B 2 . Thus 
we see that A contains the transposes of all of its elements, which is to say 
that it is a “f-algebra” of operators on W, in a sense analogous to that of 
Definition |3.11| , considered before in the setting of definite scalar products. 

Let us specialize to the case where A is as in Subsection |14.1| , correspond¬ 
ing to the subspaces Vi,..., Vi of V. For any vector subspace U of V, we 
denote by U ± the subspace of V* consisting of linear functionals 0 on V such 
that 0(w) = 0 for all u G U. A standard observation from linear algebra is 
that a vector v in V lies in U if and only if (j)(v) = 0 for all (j) in U L . From 
(114.10 we get that 


(14.17) 


{0JC7/C-C Vf C V* 


with the inclusions again being strict since they were strict before. One can 
check that a linear operator on V* lies in A in these circumstances if and 
only if each subspace VA, 1 < j < £, of V* is invariant under the operator. 

One can also verify that the commutant of A in C{W) consists of linear 
combinations of the coordinate projections from W ~ V x V* onto V x {0} 
and {0} x V*. 


15 Positive characteristic 

Throughout this section, we let p be a prime number, and we let k be a held 
of characteristic p. 

15.1 A few basic facts 

The most basic example of a held with characteristic p is Z/pZ, the held 
with p elements obtained by taking the integers Z modulo p. Any held with 
characteristic p contains a copy of this held in a canonical way, as the set of 
elements obtained by taking sums of the multiplicative identity element. In 
particular, any such held can be viewed as a vector space over Z/pZ, and if 
a held of characteristic p has a finite number of elements, then that number 
is a power of p. A well-known result is that for each positive integer m there 
is a held of characteristic p with p m elements, and this held is unique up to 
isomorphism. 
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In a field k of characteristic p we have that 


(15.1) 


(x + y) p = x p + y v 


for all x , y in k. Indeed, 

(15.2) (x + y) p = jr ( P \ x j y p ~ j = x p + y p , 

3=0 \ 3 / 


where the first step is an instance of the binomial theorem and the second 
step uses the observation that the binomial coefficient (J*j is a positive integer 
which is divisible by p when 1 < j < p — 1 , and hence gives 0 in the field k. 
Notice that 

(15.3) (x — y) p = x p — y p 


for all x, y in k. This follows from ( |15.1 ) and the fact that (—1) 
k. More precisely, (—l ) 2 = 1 = —1 when p = 2, while (—l) p = —1 
odd because (— 1 )“ = 1 when a is an even integer, such as p — 1 . 
By iterating these identities one obtains 


p = — 1 in 
when p is 


(15.4) 


(x + y) pl 


x p +y p 


l 


and 

(15.5) (x - y) pl = x pi - y pl 

for all positive integers l and all x, y in k. As a consequence of the latter we 
obtain the following. 


Lemma 15.6 If l is a positive integer and x is an element ofk which satisfies 
x p = 1, then x — 1. 


By contrast, there can be nontrivial roots of unity of other orders. If k is 
finite, with p m elements for some positive integer m, then the set of nonzero 
elements of k is a group under multiplication of order p m — 1. It is well known 
that this group is in fact cyclic. The identity 

(15.7) x p ™- 1 = 1 

for all nonzero elements x of k can be derived more simply, from the general 
result that the order of a group is divisible by the order of any subgroup, 
and hence by the order of any element of the group. Of course p m — 1 is not 
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divisible by p. Since p m — 1 is the number of nonzero elements in k when k 
has p m elements, we see that the polynomial x pm1 — 1 completely factors in 
k, i.e., it is the product of the p m — 1 linear polynomials x — a, where a runs 
through the set of nonzero elements of k. Alternatively, one can say that the 
polynomial x pm — x = x(x pm ~ 1 — 1) completely factors, as the product of the 
linear polynomials x — a, where now a runs through all elements of k. 


15.2 Representations 


Let V be a vector space over k, let G be a finite group, and let p be a 
representation of G on V. If p divides the order of G, then the argument 
described in Subsection [1.5| for finding invariant complements of invariant 
subspaces of a representation does not work, and indeed the result is not 
true. 

If PL is a subspace of V which is invariant under p, then one can at least 
consider the quotient vector space V/W, and the representation of G on V/W 
obtained from p in the obvious manner. Recall that V/W is defined using 
the equivalence relation ~ on V given by 


(15.8) 


V\ ~ V'2 if and only if V\ — v 2 € W, 


by passing to the corresponding equivalence classes. The assumption that W 
is invariant under p implies that this equivalence relation is preserved by p, 
so that p leads to a representation on the quotient space V/W. 

Thus if the representation p on V is not irreducible, so that there is an 
invariant subspace W which is neither the zero subspace nor V, then we 
get two representations of smaller positive degree, namely the restriction of 
p to W and the representation on V/W obtained from p. The sum of the 
degrees of these two new representations is equal to the degree of the original 
representation, i.e., the sum of the dimensions of W and V/W is equal to 
the dimension of V. We can repeat the process, of passing to subspaces and 
quotients, to get a family of irreducible representations, where the sum of the 
degrees of these representations is equal to the degree of p (the dimension of 
V). However, because we do not necessarily have direct sum decompositions 
at each step, the family of irreducible representations may not contain as 
much information as the original representation. 

Let F{G) denote the vector space of fc-valued functions on G. As in 
Subsection |1.3| , one has the left and right regular representations L. R. of 
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G on F(G ), and these two representations are isomorphic via the simple 
correspondence /(a) /(a -1 ) for functions on G. Note that the regular 

representations have the feature that elements of G different from the identity 
element correspond to linear transformations on F(G) which are different 
from the identity transformation. In other words, these representations give 
rise to isomorphisms between G and subgroups of the group of all invertible 
linear transformations on F(G). 


Lemma 15.9 Let Z be a vector space over k, a a representation of G on Z, 
and A a nonzero linear mapping from Z to k. For each v in Z, define f v (y) 
on G by 

(15.10) f v (y) = X(a y -i(v)), 
and put 

(15.11) U = {f v (y) : v G Z}. 

The mapping 

(15.12) v ^ f v 

is a nonzero linear mapping from Z onto U, and U is a nonzero vector 
subspace of F(G). This mapping intertwines the representations a on Z and 
the restriction of the left regular representation to U, and U is invariant 
under the left regular representation. If a is an irreducible representation of 
G, then ( \15.1Q ) is one-to-one and yields an isomorphism between a and the 
restriction of the left regular representation to U. 


This is essentially the same as Lemma |7.7| , and the proof is the same as 
before. 


15.3 Linear transformations 

Let V be a vector space over k. Suppose that R and S are two linear 
transformations on V which commute. Once again we have that 

(15.13) (R + S) p = R p + S p . 

This can be established in the same manner as before, by expanding ( R + S) p 
using the binomial theorem, and then observing that the intermediate terms 
vanish because k has characteristic p. We also get 

(15.14) ( R - S) p = R P - S p , 


96 





and, for any positive integer l, 


(15.15) (R + S) pl = R pl + S p ‘ 
and 

(15.16) (R-S) pl = R pl -S pl . 

Now assume that T is a linear transformation on V such that T p ‘ = / for 
some positive integer l, where / denotes the identity transformation on V. 
Since T and / automatically commute, we get that 

(15.17) (T - If 1 =T pl -I = 0. 

Thus T — I is nilpotent of order (at most) p l . 

If n is a positive integer and S' is a linear transformation on V such that 

(15.18) S n = /, 

then there is a well-known way to factor S in order to separate the factors of 
p in n from the rest. We begin by writing n as p l m, where l is a nonnegative 
integer and m is a positive integer which is not divisible by p. Because p l 
and m are relatively prime, it is a standard result that there are integers a, 
b such that 

(15.19) am + bp 1 = 1. 

Thus we can write 

(15.20) S = Si S‘ 2 , where Si = S an \ S 2 = S bp ‘, 

and where it is understood that S c is interpreted as being the identity trans¬ 
formation when c = 0. With these choices we have that 

(15.21) Sf = /, S 2 m = /, and Si S 2 = S 2 S v 
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